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(57) Abstract: A polynucleotide 
comprising a nucleotide sequence 
which encodes the polypeptide 
whose Eimino acid sequence is 
given in Figure (lb) or Figure (11) 
or a fragment or variant or fusion 
thereof or a fusion of said fragment 
or variant, or its complement, 
the polypeptide encoded by said 
polynucleotide and fragments 
or variants or fusions thereof, 
antibodies specifically binding the 
said polypeptide or fragments or 
variants or fusions thereof. The 
invention further comprises methods 
of producing said polypeptide and 
antibodies and uses thereof. 
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NOVEL COMPOUNDS AND METHODS 



The present invention relates to novel compounds and methods and, in 
particular, it relates to a physiological substrate for PKD/PKC|i and 
5 methods of use. 

PKD/PKC|Li is an abundant and widespread protein kinase with unique 
domain composition and enzymatic properties, until now awaiting the 
identification of any physiological function(s) and cellular substrate(s). 

10 

Plasma membrane components play a crucial role in transducing 
extracellular signals into the cell. Stimulation of several plasma membrane 
receptors, by hormones, growth factors and cytokines, causes the rapid 
hydrolysis of phosphoinositides which results in an increase of the lipid- 

15 derived second messenger diacylglycerol (DAG; Exton, 1990). DAG causes 
the activation of protein kinase C (PKC; Nishizuka, 1992; Divecha and 
Irvine, 1995), which in turn affects several cellular processes including cell 
growth and differentiation, tumor promotion, changes in cell morphology, 
neuronal development and endocrine and exocrine secretion (Kikkawa and 

20 Nishizuka, 1986; Rozengurt, 1986; Herschmann, 1991; Nishizuka, 1992). 
PKC isoforms are divided into three major groups on the basis of their 
primary structure and biochemical properties (Newton, 1997; Mellor and 
Parker, 1998; Ron and Kazanietz, 1999). The conventional PKCs (a, pi, p2 
and y) contain a regulatory cysteine-rich domain that binds DAG and 

25 phorbol esters, and a calcium binding domain, responsible for the calcium- 
dependent modulation of their kinase activity. The novel PKCs (5, e, r| and 
t) are sensitive to DAG or phorbol esters, but are calcium independent. The 
third group is known as the atypical PKCs and A,/i), which are 
unresponsive to calcium, DAG or phorbol esters. 
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Protein Kinase D (PKD, also known as PKCjx) is a serine/threonine kinase 
distantly related to the PKC family, that contains a conserved DAG/phorbol 
ester-binding cysteine-rich domain like the classical and novel PKC 
5 isoforms, but is characterised by unique en2ymological and structural 
properties (Johannes et ah, 1994; Valverde et aL^ 1994; Rozengurt et ah, 
1995; Iglesias et aL, 1998a). Unlike all other PKCs, the amino-terminal 
region of PKD lacks the typical auto-inhibitory pseudo-substrate site and 
contains a pleckstrin homology (PH) domain, unique within the PKC 

10 family, that regulates its activity (Iglesias and Rozengurt, 1998b). In intact 
cells, DAG, phorbol esters, mitogenic neuropeptides and growth factors 
induce activation of PKD through a PKC-dependent signalling pathway 
(Van Lint et ah, 1995; Zugaza et ah, 1996; Zugaza et ah, 1997), involving 
the phosphoiylation of two activation loop sites in the carboxy-terminal 

15 catalytic domain of PKD (Iglesias et ah, 1998c; Waldron et ah, 1999). The 
increased activity of phosphorylated PKD is maintained during cell 
disruption and immunoprecipitation. This characteristic further 
distinguishes PKD from other PKC members, whose in vitro activity is 
strictly dependent on the presence of lipid cofactors (Zugaza et ah, 1996). 

20 All these unique features and the divergence of the catalytic domain of PKD 
from the highly conserved kinase sub-domains of the other PKC 
isoenzymes, suggests the involvement of PKD in distinct cellular 
ftinction(s). 

25 Selective protein-protein interaction and/or the phosphorylation of unique 
downstream effectors are likely to be the basis of isotype-specific functions 
in vivo. Each PKC family member shows an optimal substrate consensus 
sequence and PKD, in particular, has a substrate specificity very different 
from other PKC isoforms (Nishikawa et ah, 1997). Consistent vvith this, it 
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has been reported previously that PKD failed to phosphoiylate several 
substrates which are actively phosphorylated by other members of the PKC 
family (Valverde et al, 1994; Van Lint et al, 1995). The substrates for 
PKD known so far are two peptides derived from glycogen synthase, the 
synthetic peptides syntide-2 and GS (Valverde et aL, 1994; Dieterich et al., 
1996), Several proteins, such as phosphatidylinositoHdnases (Nishikawa et 
al., 1998), 14.3.3 proteins (Hausser et al., 1999), B-cell receptor (BCR) 
complex, the tyrosine kinase Syk, the phospholipase Cyl (PLCyl; 
Sidorenko et al., 1996) and Bruton's tyrosine kinase (Btk; Johannes et aL, 
1999) have been reported to interact with PKD. Recently, we demonstrated 
the direct interaction of PKD witli two members of the novel PKCs, PKCs 
and PKCt] (Waldron et al, 1999), However, none of these interacting 
proteins have been shown to be PKD substrates in vivo. These unique 
stmctural and functional features of PKD suggest that this eirzyme may be 
involved in a still poorly understood signalling pathway with distinct 
cellular downstream effectors. 

To gather insights into PKD cellular function(s), we have used specific anti- 
PKD/PKC^i antibodies to identify putative partners of PKD. 

Following this strategy has surprisingly led us to purify, clone and 
characterise a novel PKD-interacting protein, which we have called 
Kidins220, that co-immunoprecipitates with PKD. 

Here we describe the characterisation of the novel protein, termed 
Kidins220 (for Kinase D interacting substrate of 220 kDa), which binds to 
PKD and acts as a specific substrate. By using confocal microscopy as well 
as biochemical fractionation, we demonstrate that Kidins220 is a novel 
integral plasma membrane protein selectively expressed in brain and 
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neurocrine cells. Kidins220 segregates in lipid rafts and, during neurite 
formation, concentrates at the growth cones, both on the plasma membrane 
and on intracellular vesicular compartments. Kidins220 not only interacts 
with PKD, but is also the first protein that has been shown to be 

5 phosphorylated by PKD. Moreover, a peptide corresponding to the 
consensus PKD-phosphorylation motif, present within the Kidins220 
sequence, behaves as one of the best peptide substrates identified so far. 
Finally, by transient expression of PKD mutants, we show that PKD activity 
is essential for the stability of Kidins220 in vivo. Inhibition of PKD leads to 

10 the aggregation and degradation of Kidins220, whereas an increase of the 
PKD kinase activity promotes Kidins220 accumulation in the cell. These 
findings indicate that PKD is an essential modulator of Kidins220 
trafficking and turnover and suggest a striking new role for the PKD 
pathway in the regulation of the stability of its substrate(s). 

15 

Two sequences corresponding to different fragments of the human 
Kidins220 homologue have been deposited with GenBank (AB033076 and 
AL133620; Nagase et aL, 1999). However, these fragments are incomplete 
sequences. Also, there is no disclosure in these GenBank entries or in 

20 Nagase et aL, 1999 of the rat cDNA, the complete Kidins220 protein, or the 
function of the Kidins220 protein- 
Subsequent to the filing of our priority application, Kong et al (2001) J, 
NeuroscL 21:176-185 (incorporated herein by reference) have cloned a 

25 polynucleotide which encodes a polypeptide with sequence identity to the 
human and rat Kidins220 polypeptide. The polypeptide, designated ARMS 
(for ankyrin, repeat rich, and membrane spanning), is reported to be a 
downstream target of the neurotrophin and ephrin receptor tyrosine kinases 
Trk and Eph respectively, and is shown to physically associate with the 
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TrkA and p75 neurotrophin receptors. Although the Kjdins220/AJRMS 
polypeptide is also shown to be phosphorylated on a tyrosine residue in 
Kong et al (2001), there is no indication of which tyrosine residue, and no 
suggestion of which kinase is acting on the protein. On the basis of its 
5 integral membrane structure, Kong et al (2001) propose that the 
Kidisn220/AJRMS polypeptide may function as an ion channel. It is further 
suggested that the multiple protein interaction domains may indicate a 
function in recruiting proteins to Trk receptor tjarosine kinases. 

10 We have also shown in this work that seven additional forms of the rat 
Kidins220 protein exist. These forms appear to arise due to differential 
splicing of the transcript. 

Knowledge of the novel protein Kidins220, its cDNA and amino acid 
15 sequences, and as a physiological target of PKD, now allows the study of 
the role of PKD in the regulation of protein stability and provides a target 
mechanism for manipulation in the treatment of pathogenic processes, such 
as neurodegenerative diseases or cancer. 

20 A first aspect of the invention provides a polynucleotide comprising a 
nucleotide sequence which encodes the polypeptide whose amino acid 
sequence is given in Figure lb or Figure 1 1 , or a fragment or variant or 
fusion thereof or a fusion of said fragment or variant or a complement 
thereof, provided that the polynucleotide is not any one of the clones 

25 corresponding to the GenBank Accession Nos ALl 33620, AB033076, 
AC007175, F36H1 or any Accession No. listed in Figures 14a or 14b. 

The polynucleotide may be DNA or RNA but it is preferred if it is DNA. 
The polynucleotide may or may not contain introns. It is preferred if it does 
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not contain introns, and it is particularly preferred if the polynucleotide is a 
cDNA. 

Figure lb shows the amino acid sequence of the rat Kidins220 polypeptide. 
5 It is preferred if the polynucleotide encodes a polypeptide comprising the 
amino acid sequence given in Figure lb, or variants or fragments or fusions 
thereof, or fusions of said variants or fragments. It is particularly preferred 
if the polynucleotide encodes a polypeptide consisting of the amino acid 
sequence given in Figure lb. 

10 

Preferably the polynucleotide is not the partial cDNA clone identified in 
WO 01/42285 as SEQ ID No 30 or a fragment thereof consisting of 
nucleotides 1-285, or 813-930 or 1 145-3381 as numbered in SEQ ID No 30. 

15 Throughout the specification where the term Kidins220 is used, it is 
intended to include a polypeptide whose amino acid sequence is given in 
Figure lb or the human homologue of this sequence, unless the context 
suggests otherwise. The amino acid sequence of the human homologue 
(human Kidins220) is given in Figure 11. 

20 

A polynucleotide of the invention is not a polynucleotide which is any one 
of the DNA clones corresponding to the GenBank Accession Nos 
AL133620, AB033076, AC007175, F36H1 or any Accession No. listed in 
Figures 14a or 14b. Preferably the polynucleotide of the invention is not the 
25 partial cDNA clone identified in WO 01/42285 as SEQ ID No 30 or a 
fragment thereof consisting of nucleotides 1-285, or 813-930 or 1145-3381 
as numbered in SEQ ID No 30. GenBank Accession No AL133620 is a 
partial cDNA cloned within the vector pSportl. The cDNA sequence 
appears to encode a C-terminal portion of the human Kidins220 homologue. 
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GenBank Accession No AB033076 is a partial cDNA cloned into the 
pBluescriptll SK plus vector. The cDNA sequence appears to encode an N- 
terminal portion of the human Kidins220 homologue. GenBank Accession 
No AC007175 is a Drosophila chromosome 2R DNA sequence contained 
5 within a BAG clone. Accession No F36H1 is a cosmid containing C 
elegans genome sequence. Accession No AA998517 is an Expressed 
Sequence Tag (EST) cloned into the pT7T3D-'Pac vector. Accession No 
N83544 is an EST cloned into a Lambda ZAP Express vector and 
Accession No AA685317 is an EST cloned into a pBluescript SK- vector. 

0 

The partial cDNA clone identified as SEQ ID No 30 in WO 01/42285 is a 
human DNA sequence contained within the plasmid pINCY (Incyte 
Genomics, Palo Alto, CA). 



15 For the avoidance of doubt, however, the polynucleotides of the invention 
include polynucleotides which encode the amino sequences described in the 
GenBank Accession Nos listed in Figure 14b since these sequences are 
believed to be derived from the human Kidins220 protein but as noted 
above do not include the specific DNA clones corresponding to the given 

20 GenBank Accession Nos. It is particularly preferred if the polynucleotides 
which encode a polypeptide which contains tlie amino acid sequences 
described in the GenBank Accession Nos listed in Figure 14b encode the 
human Kidins220 polypeptide. It is particularly preferred if the 
polynucleotides of the invention contain the nucleotide sequence given in 

25 GenBank Accession Nos listed in Figure 14b. A sequence of human 
Kidins220 cDNA is shown in Figure 13. The sequence shows a greater than 
91% homology with the sequence encoding rat Kidins220 polypeptide 
shown in Figure lb and as such is within the scope of this invention. 
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In a preferred embodiment, the polynucleotide of the first aspect of the 
invention comprises a nucleic acid sequence given in Figure 7 or 17. It is 
particularly preferred if the polynucleotide comprises or consists of the 
sequence corresponding to the open reading frame (ORF) in Figure 7 or 17 
as this encodes the rat Kidins220 polypeptide. It is also particularly 
preferred if the polynucleotide comprises or consists of the sequence 
corresponding to the ORF in Figure 13 as this encodes the human 
Kidins220 polypeptide. 

Polynucleotides of the invention also include polynucleotides which encode 
any polypeptides encoded by the specific polynucleotides described such as 
the polynucleotides described in Figures 7, 17 and 13. 

In another embodiment of the first aspect of the invention the 
polynucleotide encodes a polypeptide whose amino acid sequence is as 
described in Figure 8. These polypeptides appear to be produced by splice 
variants of the Kidins220 mRNA/cDNA. 

In particular, amino acid residues between the arrows in Figure lb (residue 
numbers 1140 and 1232) are believed to constitute a "splicing cassette'* 
wherein the "cassette" shown in Figure lb is replaced in the variants with 
an alternative cassette whose amino acid sequence is chosen among those 
reported between the square brackets in Figure 8. 

It is understood that the polynucleotide which encodes the polypeptide 
whose amino acid sequence is given in Figure lb or Figure 11 (or a variant 
or fragment thereof) may vary in sequence due to degeneracy of the genetic 
code. For example, different codons can be substituted which code for the 
same amino acid(s) as the original codons. Alternatively, the substitute 
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codons may code for a different amino acid that will not affect the activity 
or immunogicity of the polypeptide or which may improve or otherwise 
modulate its activity or immtmogenicity. For example, site-directed 
mutagenesis or other techniques can be employed to create single or 
5 multiple mutations, such as replacements, insertions, deletions, and 
transpositions, as described in Botstein and Shortle, "Strategies and 
Applications of In Vitro Mutagenesis," jScienc^, 229: 193-210 (1985), which 
is incorporated herein by reference. Since such modified polynucleotides 
can be obtained by the application of known techniques to the teachings 
10 contained herein, such modified polynucleotides are within the scope of the 
claimed invention. 



Moreover, it will be recognised by those skilled in the art that the 
polynucleotide sequence, (or fragments thereof) of the invention can be 

15 used to obtain other polynucleotide sequences (which may be considered to 
be variants) that hybridise with it under conditions of high stringency. Such 
polynucleotides includes any genomic DNA. The polynucleotide of the 
invention includes polynucleotides which hybridise to the DNA whose 
sequence is given in Figure 7 or Figure 13 under stringent conditions, 

20 preferably at high stringency. Accordingly, the polynucleotide of the 
invention includes polynucleotides that show at least 55%, preferably at 
least 60%, and more preferably at least 65%, 70% or 75% or 80% or 85%, 
and most preferably at least 90% or 95% or 98% identity with flie 
polynucleotide whose nucleotide sequence is shown in Figure 7 or Figure 

25 13, provided that such homologous polynucleotide encodes a polypeptide 
whose amino acid sequence is given in Figure lb or Figure 1 1 or a variant 
or fragment or fusion thereof and is not any one of GenBank Accession Nos 
AL133620, AB033076, AC007175, F36Hlor any GenBank Accession No. 
listed in Figures 14a or 14b. A preferred polynucleotide variant is one 
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which has the nucleotide sequence shown in Figure 17, or one which 
encodes a polypeptide with tiie amino acid sequence shown in Figure 18. 



Per cent identity between nucleotide sequences can be determined by, for 
5 example, the Blast 2 sequence program (T.A. Tatusova and T.L. Madden 
FEMS Microbiol- Lett. (1999) 174:247-250) at the NCBI site 
(http://www.ncbi.nlm,nih.gov/gorfibl2.html) using as parameters the 
scoring matrix BLOSUM62, opening gap penalty —5, extending gap penalty 
-2. 

10 

It is believed that Kidins220 is found in mammals other than rat and 
humans. The present invention therefore includes poljniucleotides which 
encode Kidins220 from other mammalian species including human, mouse, 
cow, pig, sheep, rabbit and so on, but which polynucleotide is not any one 
15 of GenBank Accession Nos AL133620, AB033076, AC007175, F36H1 or 
any GenBank Accession No. listed in Figures 14a or 14b or the partial 
cDNA clone identified in WO 01/42285 as SEQ ID No 30 or fragments 
thereof consisting of nucleotides 1-285, 813-930 or 1145-3381 as numbered 
in SEQ ID No 30. 

20 

The polynucleotide of the invention has at least 10 nucleotides, and 
preferably at least 15 nucleotides and more preferably at least 30 
nucleotides. In a further preference, the polynucleotide is more than 50 
nucleotides, more preferably at least 100 nucleotides, and still more 
25 preferably the polynucleotide is at least 500 nucleotides. The 
polynucleotide may be more than Ikb, and may comprise more than 5kb. 

In one embodiment, the polynucleotide encodes a polypeptide whose amino 
acid sequence comprises the sequence CAASSESTGFGEERSIL or 
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RQMQRTITRQMSFDLTK or RQMQRTITRQMAFDLTK or a 
polypeptide which comprises the amino acid sequence (I/L)XRQM(S/A)J, 
where X is any amino acid, J represents a hydrophobic residue such as 
isoleucine, leucine, phenylalanine, tryptophan, tyrosine or valine, (I/L) 
5 represents isoleucine or leucine, and (S/A) represents serine or alanine. 
Preferably, the polynucleotide encodes a polypeptide which comprises the 
amino acid sequence (I/L)XRQM(S/A)J and fiirttier amino acids to fee N- 
terminus and C-terminus of this motif present in rat or human Kidins220, 
Typically the polynucleotide encodes a polypeptide of at least 10 amino 

10 acids, preferably at least 20 amino acids and more preferably at least 50 or 
100 amino acids. Such polynucleotides are useful for expressing large 
amounts of these polypeptides which are particularly useful in procedures 
such as raising antibodies or competition assays. As described below, the 
peptide CAASSESTGFGEERSIL is useful for raising antibodies which are 

15 selective for Kidins220, and the sequence RQMQRTITRQMSFDLTK 
contains a phosphorylation site for Protein Kinase D (underlined). 



It is further preferred if the polynucleotide comprises the ^ open reading 
frame given in Figure 7 or Figure 13 or Figure 17. Such a polynucleotide is 
20 useful for molecular manipulations where the full length Kidins220 cDNA 
is to be used, for example, in generating a DNA constmct suitable for 
expressing the full length Kidins220 polypeptide. 

The invention also includes a polynucleotide which is able to selectively 
25 hybridise to a polynucleotide which encodes the polypeptide whose amino 
acid sequence is given in Figure lb or Figure 1 1, or a fragment or variant or 
fusion thereof, or a fusion of said variant or fragment. A preferred variant 
to which the polynucleotide is able to selectively hybridise is that shown in 
Figure 17. Preferably, said polynucleotide is at least 10 nucleotides, more 
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preferably at least 15 nucleotides and still more preferably at least 30 
nucleotides in length. The said polynucleotide may be longer than 100 
nucleotides and may be longer than 200 nucleotides, but preferably the said 
polynucleotide is not longer than 250 nucleotides. Such polynucleotides are 
5 useful in procedures as a detection tool to demonstrate the presence of the 
polynucleotide in a sample. Such a sample may be a sample of DNA, such 
as a bacterial colony, fixed on a membrane or filter, or may be a sample 
from a patient, particularly a patient in whom cancer has been found or is 
suspected. 

10 

By "selectively hybridise" we mean that the polynucleotide hybridises 
under conditions of high stringency. DNA-DNA, DNA-RNA and RNA- 
RNA hybridisation may be performed in aqueous solution containing 
between O.IX SSC and 6X SSC and at temperatures of between 55^ C and 
15 70' C. It is well known in the art that the higher the temperature or the 
lower the SSC concentration the more stringent the hybridisation 
conditions. By "high stringency" we mean 2X SSC and 65"C. IX SSC is 
0.1 5M NaCl/0.015M sodium citrate. Polynucleotides which hybridise at 
high stringency are included within the scope of the claimed invention. 

20 

In another embodiment, the polynucleotide can be used as a primer in the 
polymerase chain reaction (PGR), and in this capacity a polynucleotide of 
between 15 and 30 nucleotides is preferred. A poljoiucleotide of between 
20 and 100 nucleotides is preferred when the fragment is to be used as a 
25 mutagenic PCR primer. It is particularly preferred if the PCR primer (when 
not being used to mutate a nucleic acid) contains about 15 to 30 contiguous 
nucleotides (ie perfect matches) from the nucleotide sequence given in 
Figure 7 or 17 or the human cDNA sequence shown in Figure 13. Clearly, 
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if the PGR primers are used for mutagenesis^ differences compared to the 
sequence will be present- 
Primers which are suitable for use in a polymerase chain reaction (PGR; 
5 Saiki et al (1988) Science 239, 487-491) are preferred. Suitable PGR 
primers may have the following properties: 

It is well known that the sequence at the 5' end of the oligonucleotide need 
not match the target sequence to be amplified. 

10 

It is usual that the PGR primers do not contain any complementary 
structures witib each other longer than 2 bases, especially at their 3' ends, as 
this feature may promote the formation of an artifactual product called 
"primer dimer". When the 3' ends of the two primers hybridize, they form 
15 a "primed template" complex, and primer extension results in a short duplex 
product called "primer dimer". 

Internal secondary stmcture should be avoided in primers. For symmetric 
PGR, a 40-60% G+G content is often recommended for both primers, with 

20 no long stretches of any one base. The classical melting temperature 
calculations used in conjunction with DNA probe hybridization studies 
often predict that a given primer should anneal at a specific temperature or 
that the 72°G extension temperature will dissociate the primer/template 
hybrid prematurely. In practice, the hybrids are more effective in the PGR 

25 process than generally predicted by simple Tm calculations. 

Optimum annealing temperatures may be determined empirically and may 
be higher than predicted. Taq DNA polymerase does have activity in the 
37-5 5°C region, so primer extension will occur during the annealing step 
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and the hybrid will be stabilized. The concentrations of the primers are 
equal in conventional (symmetric) PGR and, typically, within 0.1- to InM 
range. 



5 When a pair of suitable nucleic acids of the invention are used in a PGR it is 
convenient to detect the product by gel electrophoresis and ethidium 
bromide staining. As an alternative to detecting the product of DNA 
amplification using agarose gel electrophoresis and ethidium bromide 
staining of the DNA, it is convenient to use a labelled oligonucleotide 

10 capable of hybridising to the amplified DNA as a probe. When the 
amplification is by a PGR the oligonucleotide probe hybridises to the 
interprimer sequence as defined by the two primers. The probe may be 
labelled with a radionuclide such as ^^P, ^^P and ^^S using standard 
techniques, or may be labelled with a fluorescent dye. When the 

15 oligonucleotide probe is fluorescently labelled, the amplified DNA product 
may be detected in solution (see for example Balaguer et al (1991) 
"Quantification of DNA sequences obtained by polymerase chain reaction 
using a bioluminescence adsorbenf* AnaL Biochem, 195, 405-110 and 
Dilesare et al (1993) "A high-sensitivity electrochemiluminescence-based 

20 detection system for automated PGR product quantitation" BioTechniques 
15, 152-157. 

PGR products can also be detected using a probe which may have a 
fluorophore-quencher pair or may be attached to a solid support or may 
25 have a biotin tag or they may be detected using a combination of a capture 
probe and a detector probe. 
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Fluorophore-quencher pairs are particularly suited to quantitative 
measurements of PGR reactions (eg RT-PCR). Fluorescence polarisation 
using a suitable probe may also be used to detect PGR products, 

5 Oligonucleotide primers can be synthesised using methods well known in 
the art, for example using solid-phase phosphoramidite chemistry. 

As has been referred to above, and is discussed in more detail in the 
Examples, splice variants of ICidins220 exist. A preferred polynucleotide of 

10 the invention includes those polynucleotides which can distinguish the usual 
Kidins220 cDNA/mRNA (eg that encodes the Kidins220 amino acid 
sequence shown in Figure lb) from the splice variants (eg those whose 
amino acid differences are shown in Figure 8) or the different splice 
variants from each other. Typically, the polynucleotide in this embodiment 

15 is one which hybridises to a portion of nucleic acid which encodes a portion 
of the usual Kidins220 amino acid sequence but does not encode a portion 
of the variant (ie is specific for the usual Kidins220 cDNA/mRNA form). 
Alternatively, the polynucleotide may be one which hybridises to a portion 
of nucleic acid which encodes a portion of a specific splice variant 

20 Kidins220 amino acid sequence but does not encode a portion of the usual 
Kidins220 or of another splice variant (ie is specific for a particular splice 
variant). Suitable polynucleotides may readily be made by the skilled 
person by reference to Figures lb, 7, 8 and 17. 

25 It will also be appreciated that PGR primers may be made by reference to 
those Figures which can be used to distinguish the splice variants. For 
example, the PGR primers may be ones which hybridise to nucleotide 
sequences common to the usual Kidins220 mRNA/cDNA and the splice 
variants, and the presence or absence of the usual Kidins220 or particular 
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splice variants determined by reference to the size of DNA amplified. For 
example, PGR primer which hybridise to nucleic acid encoding portions of 
Kidins220 either side of the splice cassette referred to above may be used. 



5 Methods currently used for detection of Single Nucleotide Polymorphisms 
can also be used to distinguish splice variants, such as Invader Assay™ 
(Third Wave Technologies, Inc.), assays based on detection of primer 
extension such as SNP-IT™ (Orchid Biosciences, Inc.) and Rolling Circle 
Amplification (Lizardi et al (1998) ISlat Genet 19(3), 225-32). 

10 

A pol3aiucleotide of the first aspect of the invention may encode a variant of 
tiie polypeptide whose amino acid sequence is given in Figure lb or Figure 
11. A "variant" of the polypeptide includes natural variants, including 
allelic variants and naturally-occurring mutant forms, ''s^Mcg variants" 

15 (including those whose regions of variation in amino acid sequence are 
shown in Figure 8), and variants with insertions, deletions and substitutions, 
either conservative or non-conservative, where such changes do not 
substantially alter the activity of the said polypeptide. In addition, the 
insertions and/or deletions may lead to frameshifl mutations which may 

20 encode truncated (or elongated) polypeptide products, and insertions, 
deletions or other mutations may lead to tiie introduction of stop codons 
which encode truncate polypeptide products. A preferred variant of the 
polypeptide is one which comprises the sequence shown in Figure 18. 

25 By "conservative substitution" is intended combinations such as Gly, Ala; 
Val, He, Leu; Asp, Glu; Asn, Gin; Ser, Thr; Lys, Arg; and Phe, Tyr. 

By "non-conservative substitution" we include other substitutions, such as 
those where the substituted residue mimics a particular modification of the 



BNSDCCID: <V/0 0220736A2 ! > 




wo 02/20786 PCT/GBO 1/03977 

17 

replaced residue, for example a phosphorylated serine may be replaced by 
aspartate or glutamate due to the similarity of the aspartate or glutamate side 
chain to a phosphorylated serine (ie they carry a negative charge at neutral 
PH). 

5 

Further non-conservative substitutions which are included in the term 
"variants" are point mutations which alter one, sometimes two, and usually 
no more than three amino acids. Such mutations are well known in the art 
of biochemistry and are usually designed to insert or remove a defined 
10 characteristic of the polypeptide. Examples of useful mutations in this 
invention are mutation of serine 919 (or its equivalent, such as serine 918 of 
Figure 18) to a non-phosphorylatable residue, such as alanine, to provide a 
form of the polypeptide which is not a target for Protein Kinase D but may 
still compete for the active site of the kinase. Another type of non- 
15 conservative mutation is the alteration or addition of a residue to a cysteine 
or lysine residue which can then be used with maleimide or succinimide 
cross-linking reagents to covalently conjugate the polypeptide to anottier 
moiety, Non-glycosylated proteins may be mutated to convert an 
asparagine to the recognition motif N-X-S/T for N-linked glycosylation. 
20 Such a modification may be useful to create a tag for purification of the 
polypeptide using Concanavalin A-linked beads. 

Other useful mutations include the replacement of one or more lysine 
residues in the motifs SKLLPGKKSS (residues 1481-1490) and 
25 SICSEDKKS (residues 1637-1645) which are putative ubiquitinylation 
sites. Other useful mutations also include the replacement of one or more 
residues (especially lysine) in the motif AQWGSGKS (residues 467-474) 
which is a putative ATP binding motif and the replacement of one or more 
tyrosine residues in the motifs KAGETPY (residues 403-409) and 
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KRGDVIDY (residues 1446-1453) which are putative tyrosine 
phosphorylation sites. 

Such variants may be made using the methods of protein engineering and 
5 site-directed mutagenesis well known in the art. 

Variants of the Kidins220 polypeptide include polypeptides with at least 
65% identity to the amino acid sequence given in Figure lb or Figure 11, 
preferably at least 70% or 80% or 85% or 90% identity to said sequence, 

10 and more preferably at least 95% or 98% identity to said amino acid 
sequence. A preferred variant is one with the sequence shown in Figure 18- 
Percent identity can be determined by, for example, the LALIGN program 
(Huang and Miller, Adv. Appl. Math. (1991)12:337-357) at the Expasy 

15 facility site (http://www.ch.embnet.org/software/LALIGN form,html) using 
as parameters the global aligimient option, scoring matrix BLOSUM62, 
opening gap penalty -14, extending gap penalty -4. Comparison between 
rat and C.elegans Kidins220 is shown in Figure 10. 

20 Preferably, the polynucleotide encodes a Kidins220 that has at least 30%, 
preferably at least 50% and more preferably at least 70% of the activity of a 
natural Kidins220, under the same assay conditions. 

Kidins220 may be assessed in bioassays based on its function as a 
25 physiological substrate of Protein Kinase D and phosphorylation by this 
kinase. Kidins220 may also be assessed in bioassays based on its ability to 
bind ATP or analogues thereof Kidins220 may also be assayed using 
immunological techniques such as ELISA and tiie like. Further functions of 
Kidins220 are as a substrate for a tyrosine kinase, preferably a tyrosine 
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kinase acting downstream of neurotrophin and ephrin receptor tyrosine 
kinases, and as a protein capable of associating with TrkA and p75 
neurotrophin receptors. Such tyrosine phosphorylation and protein 
association is described in Kong et al (2001) J. Netiroscience, 21:176-185 
5 (incorporated herein by reference). 

Further activities resulting from function of Kidins220 which may be 
assessed are cell differentiation, neurite elongation and growth cone 
progression (Lamoureux et al (1997) J. Cell Science 110:635-4; Isbister and 

10 O'Connor (1999) J. NeuroscL 19:2589-600; Fritsche et al (1999) Mol Cell 
Neurosci. 14:389-418). The polymerisation rates of cytoskeletal 
components such as actin or microtubules or microtubule components may 
be affected by the presence or form of Kidins220, and as such may also be 
activities suitable for reporting Kidins220 function. The interaction of 

15 Kidins220 with the polymerised actin or microtubule components may 
present a further activity useful for assessing Kidins220 function. 

Due to the accumulation of Kidins220 in the area adjacent to the growth 
cone in a vesicular compartment, Kidins 220 is believed to play a role in the 

20 Opposition of new membrane during neurite elongation and/or neuronal 
remodelling. Thus, Kidins220 is believed to have a role in CNS and PNS 
development. Experiments suggest that Kidins220 is present in primary 
motomeurone from spinal cord with a distribution similar to that observed 
in differentiated PC 12. A failure in Kidins220 might determine failure in 

25 neuronal differentiation or in its maintenance. A neurodegenerative profile 
is therefore a possible pathology associated with the impairment of 
Kidins220 function. 
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By "fragment of the polypeptide whose amino acid sequence is given in 
Figure lb" or Figure 1 1 we include any fragment which retains activity of 
the full length polypeptide or which is useful in some other way, for 
example, for use in raising antibodies or in a binding assay, but which is not 
5 the polypeptide expressed in Nagase et al (1 999) DNA Res. 6; 337-345. 

As discussed below, useful fragments also include those that contain the 
phosphorylatable serine (serine 919). Typically, the fragments are at least 
five consecutive residues, preferably at least 10 consecutive residues and 
10 may be 15, 25, 50, 100, 150 or more consecutive residues. 

By "fusion of the polypeptide whose amino acid sequence is given in Figure 
lb or Figure 11" we include said Kidins220 polypeptide, fused to any other 
polypeptide. For example, the said Kidins220 may be fused to a 

15 polypeptide such as glutathione- S-transferase (GST) or protein A in order to 
facilitate purification of Kidins220, or it may be fused to some other 
polypeptide which imparts some desirable characteristics on the Kidins220 
polypeptide, for example a fluorescent protein such as a green fluorescent 
protein (GFP). Fusions to any variant (for example, the sequence shown in 

20 Figure 1 8) or fragment are included in the scope of the invention. 

Conveniently, a polynucleotide of the invention further comprises a 
detectable label. By "detectable label" is included any convenient 
radioactive label such as ^^P, ^^P or ^^S which can readily be incorporated 
25 into a nucleic acid molecule using well known methods; any convenient 
fluorescent or chemiluminescent label which can readily be incorporated 
into a nucleic acid is also included. In addition the term "detectable label" 
also includes a moiety which can be detected by virtue of binding to another 
moiety (such as biotin which can be detected by binding to streptavidin); 
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and a mm^^ yipne of its 

ability to convert a colourless compomd into a coloured compound, or vice 
versa (for example, alkaline phosphatase can convert colourless o- 
nitrophenylphosphate into coloured o-nitrophenol). The detectable label 
may also be a fluorophore-quencher pair as described in Tyagi & Kramer 
(1996) Nature Biotechnology 14, 303-308. 

A second aspect of the invention provides an expression vector comprising 
a polynucleotide according to the first aspect of the invention. 

A variety of methods have been developed to operably link polynucleotides, 
especially DNA, to vectors, for example, via complementary cohesive 
termini. For instance, complementary homopolymer tracts can be added to 
the DNA segment to be inserted into the vector DNA. The vector and DNA 
segment are then joined by hydrogen bonding between the complementaiy 
homopolymeric tails to form recombinant DNA molecules. 

Synthetic linkers containing one or more restriction sites provide an 
alternative method of joining the DNA segment to vectors. The DNA 
segment, generated by endonuclease restriction digestion as described 
earlier, is treated with bacteriophage T4 DNA polymerase or E.coli DNA 
polymerase I, en2ymes that remove protruding, 3 '-single-stranded termini 
with their 3'-5'-exonucleolytic activities, and fill in recessed 3 '-ends with 
their polymerising activities. 

The combination of these activities therefore generates blunt-ended DNA 
segments. The blunt-ended segments are then incubated with a larger molar 
excess of linker molecules in the presence of an enzyme that is able to 
catalyse the ligation of blunt-eiided DNA molecules, such as bacteriophage 
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T4 DNA ligase. Thus, the products of the reaction are DNA segments 
carrying polymeric linker sequences at their ends. These DNA segments 
are then cleaved with the appropriate restriction enzyme and ligated to an 
expression vector that has been cleaved with an enzyme that produces 
5 termini compatible with those of the DNA segment. 

Synthetic linkers containing a variety of restriction endonuclease site are 
commercially available from a number of sources including International 
Biotechnologies Inc., New Haven, CN, USA. 

10 

A desirable way to modify tiie DNA encoding the polypeptide of the 
invention is to use PGR. This method may be used for introducing the 
DNA into a suitable vector, for example by engineering in suitable 
restriction sites, or it may be used to modify the DNA in other useful wasy 
15 as is known in the art. 

In this method the DNA to be enzymatically amplified is flanked by two 
specific primers which themselves become incorporated into the amplified 
DNA. The said specific primers may contain restriction endonuclease 
20 recognition sites which can be used for cloning into expression vectors 
using methods known in the art. 

The DNA (or in the case of retroviral vectors, RNA) is then expressed in a 
suitable host to produce a polypeptide comprising the polypeptide of the 
25 invention. Thus, the DNA encoding the polypeptide constituting the 
polypeptide of the invention may be used in accordance with known 
techniques, appropriately modified in view of the teachings contained 
herein, to construct an expression vector, which is then used to transform an 
appropriate host cell for the expression and production of the polypeptide of 
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the invention. Such techniques include those disclosed in US Patent Nos. 
4,440,859 issued 3 April 1984 to Rutter et al, 4,530,901 issued 23 July 1985 
to Weissman, 4,582,800 issued 15 April 1986 to Crowl, 4,677,063 issued 30 
June 1987 to Mark et al, 4,678,751 issued 7 July 1987 to Goeddel, 
5 4,704,362 issued 3 November 1987 to Itakura et al, 4,710,463 issued 1 
December 1987 to Murray, 4,757,006 issued 12 July 1988 to Toole, Jr. et 
al, 4,766,075 issued 23 August 1988 to Goeddel et al and 4,810,648 issued 
7 March 1989 to Stalker, all of which are incorporated herein by reference. 

10 The DNA (or in the case or retroviral vectors, RNA) encoding the 
polypeptide constituting the polypeptide of the invention may be joined to a 
wide variety of other DNA sequences for introduction into an appropriate 
host. The companion DNA will depend upon the nature of the host, the 
manner of the introduction of the DNA into the host, and whether episomal 

15 maintenance or integration is desired. 

Generally, the DNA is inserted into an expression vector, such as a plasmid, 
in proper orientation and correct reading frame for expression;^ If necessary, 
the DNA may be linked to the appropriate transcriptional and translational 

20 regulatory control nucleotide sequences recognised by the desired host, 
although such controls are generally available in the expression vector. The 
vector is then introduced into the host through standard techniques. 
Generally, not all of the hosts will be transformed by the vector. Therefore, 
it will be necessary to select for transformed host cells. One selection 

25 technique involves incorporating into the expression vector a DNA 
sequence, with any necessary control elements, that codes for a selectable 
trait in the transformed cell, such as antibiotic resistance. Alternatively, the 
gene for such selectable trait can be on another vector, which is used to co- 
transform the desired host cell. 
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Host cells that have been transformed by the expression vector of the 
invention are then cultured for a sufficient time and under appropriate 
conditions known to those skilled in the art in view of the teachings 
5 disclosed herein to permit the expression of the polypeptide, which can then 
be recovered. 

Many expression systems are known, including bacteria (for example, 
E.coli and Bacillus subtilis), yeasts (for example Saccharomyces 
10 cerevisiae)^ filamentous fungi (for example Aspergillus) y plant cells, animal 
cells and insect cells. 

The vectors typically include a prokaryotic replicon, such as the ColEl arz, 
for propagation in a prokaryote, even if the vector is to be used for 
15 expression in other, non-prokaryotic, cell types. The vectors can also 
include an appropriate promoter such as a prokaryotic promoter capable of 
directing the expression (transcription and translation) of the genes in a 
bacterial host cell, such as E.coli^ transformed therewith. 

20 A promoter is an expression control element formed by a DNA sequence 
that permits binding of RNA polymerase and transcription to occur. 
Promoter sequences compatible with exemplary bacterial hosts are typically 
provided in plasmid vectors containing convenient restriction sites for 
insertion of a DNA segment of the present invention, 

25 

Typical prokaryotic vector plasmids are pUC18, pUC19, pBR322 and 
pBR329 available from Biorad Laboratories, (Richmond, CA, USA) and 
pTrc99A and pKK223-3 available firom Pharmacia, Piscataway, NJ, USA. 



BNSDOCID: <WO 



C?207S6A2 




\V0 02/20786 PCT/GBOl/03977 

25 

A typical mammalian cell vector plasmid is pSVL available from 
Phamiacia, Piscataway, NJ, USA. This vector uses the SV40 late promoter 
to drive expression of cloned genes, the highest level of expression being 
found in T antigen-producing cells, such as COS-1 cells. 

5 

An example of an inducible mammalian expression vector is pMSG, also 
available from Pharmacia. This vector uses the glucocorticoid-inducible 
promoter of the mouse mammary tumour vims long teraiinal repeat to drive 
expression of the cloned gene. 

10 

Useful yeast plasmid vectors are pRS403-406 and pRS413-416 and are 
generally available from Stratagene Cloning Systems, La Jolla, CA 92037, 
USA. Plasmids pRS403, pRS404, pRS405 and pRS406 are Yeast 
Integrating plasmids (Yips) and incorporate the yeast selectable markers 
15 HIS3, TRPl, LEU2 and URA3. Plasmids pRS413-416 are Yeast 
Centromere plasmids (Ycps). 

Other vectors and expression systems are well known in the art for use with 
a variety of host cells. 

20 

In one embodiment of the second aspect of the invention, the expression 
vector is one suitable for use in gene therapy. 

A third aspect of the invention provides a polynucleotide comprising the 
25 sequence shown in Figure 16 or a fragment or variant thereof provided that 
it is not the clone corresponding to GenBank Accession No AGO 12495. 

In a preferred embodiment, the polynucleotide or fragment or variant of 
polynucleotide is capable of promoting transcription, and in an even more 
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preferred embodiment, the transcription is promoted in a neuronal cell- 
selective way. 

By "promote transcription" we include the meaning that the polynucleotide 
5 is one which when operably liidced to another polynucleotide allows for the 
transcription of that other polynucleotide in a suitable cell, such as a 
mammalian cell. 

By "neuronal cell-selective" we include the meaning that the polynucleotide 
10 is able to promote transcription in neuronal cells (typically mammalian 
neuronal cells) but is less able or is substantially incapable of promoting 
transcription in non-neuronal cells. Conveniently, the polynucleotide can 
promote transcription at least 5 times more efficiently (using a standard 
transcription assay such as chloramphenicol acetyl transferase expression) 
15 in neuronal cells than non-neuronal cells. Preferably, the polynucleotide is 
at least 10 times or, more preferably 50 times, more efficient in neuronal 
cells than non-neuronal cells. 

In one embodiment, the polynucleotide comprises at least 10 consecutive 
20 nucleotides located between positions 34609 and 36632 as shown in Figure 
16. Preferably, the polynucleotide comprises at least 15, 20, 25, 30, 35, 40 
or 50 or 100 or 200 or 500 consecutive nucleotides firom between positions 
34609 and 36632 as shown in Figure 16. 

25 It is preferred if the polynucleotide comprises a polynucleotide encoding a 
polypeptide. Conveniently, the polynucleotide of this aspect of the 
invention and the polynucleotide expressing the polypeptide are operably 
linked. The polypeptide encoded may be any suitable polypeptide, but it is 
preferred if the polypeptide is one with therapeutic capabilities. The nature 
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of the therapeutic capabihty required depends on the circumstances, but 
examples may include nerve growth factors such as Nerve Growth Factor 
(NGF), Brain Derived Neurotrophic Factor (BDNF), polypeptides useful in 
treatment of neurodegenerative disease and the like. In an especially 
5 preferred embodiment, the polypeptide is the polypeptide whose amino acid 
sequence is given in Figure lb or Figure 11, or a fragment or variant or 
fusion of said fragment or variant. A preferred variant of the polypeptide is 
one which has the sequence shovm in Figure 18. 



10 In an alternative preferred embodiment, the polypeptide is Protein IGnase 
D, or a mutant thereof such as are discussed below. 

The polypeptide may also be any polypeptide which is detectable. Such 
polypeptides are useful in imaging cells which are able to use the 
15 polynucleotide whose nucleotide sequence is given in Figure 16 to promote 
transcription, or as markers by which to specifically target other reagents to 
the expressing cells . 

In a further embodiment of this aspect of the invention, the polynucleotide 
20 is one able to replicate within a cell. 

A fourth aspect of the present invention provides a host cell transformed 
with a pol3aiucleotide vector construct of the present invention. The host 
cell can be either prokaryotic or eukaryotic. Bacterial cells are preferred 
25 prokaryotic host cells and t)^ically are a strain of E,coli such as, for 
example, the E,coli strains DH5 available from Bethesda Research 
Laboratories Inc., Bethesda, MD, USA, and RRl available from the 
American Type Culture Collection (ATCC) of Rockville, MD, USA (No. 
ATCC 31343). Preferred eukaryotic host cells include yeast, insect and 
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maiximalian cells, preferably vertebrate cells such as those from a mouse, 
rat, monkey or human fibroblastic and kidney cell lines. Yeast host cells 
include YPH499, YPH500 and YPH501 which are generally available from 
Stratagene Clonmg Systems, La JoUa, CA 92037, USA. Preferred 
5 mammalian host cells include Chinese hamster ovary (CHO) cells available 
from the ATCC as CRL 1658 and 293 cells which are human embryonic 
kidney cells. Preferred insect cells are Sf9 cells which can be transfected 
with baculovirus expression vectors. 

10 Transformation of appropriate cell hosts with a DNA construct of the 
present invention is accomplished by well known methods that typically 
depend on the type of vector used. With regard to transformation of 
prokaryotic host cells, see, for example, Cohen et al (1972) Proc. NatL 
Acad. Sci. USA 69, 2110 and Sambrook et al (1989) Molecular Cloning, A 

15 Labor atoiy Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, 
NY. Transformation of yeast cells is described in Sherman et al (1986) 
Methods In Yeast Genetics, A Laboratory Manual^ Cold Spring Harbor, NY. 
The method of Beggs (1978) Nature 275, 104-109 is also useftil. With 
regard to vertebrate cells, reagents useful in transfecting such cells, for 

20 example calcium phosphate and DEAE-dextran or liposome formulations, 
are available from Stratagene Cloning Systems, or Life Technologies Inc., 
Gaithersburg, MD 20877, USA. 

Electroporation is also useful for transforming and/or transfecting cells and 
25 is well known in the art for transforming yeast cells, bacterial cells, insect 
cells and vertebrate cells. 

For example, many bacterial species may be transformed by the methods 
described in Luchansky et al (1988) Mol. Microbiol. 2, 637-646 
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incorporated herein by reference. The greatest number of transformants is 
consistently recovered following electroporation of the DNA-cell mixture 
suspended in 2.5 PEB using 6250V per cm at 25 jxFD. 

Methods for transformation of yeast by electroporation are disclosed in 
Becker & Guarente (1990) Methods Enzymol. 194, 182. 

Successftilly transformed cells, ie cells that contain a DNA construct of the 
present invention, can be identified by well known techniques. For 
example, cells resulting fi:om the introduction of an expression construct of 
the present invention can be grown to produce the polypeptide of the 
invention. Cells can be harvested and lysed and their DNA content 
examined for the presence of the DNA using a meftod such as that 
described by Southern (1975) J. Mol. Biol. 98, 503 or Berent et al (1985) 
Biotech. 3, 208. Alternatively, the presence of the protein in the supernatant 
can be detected using antibodies as described below. 

In addition to directly assaying for the presence of recombinant DNA, 
successful transformation can be confirmed by well known immunological 
methods when the recombinant DNA is capable of directing the expression 
of the protein. For example, cells successfully transformed with an 
expression vector produce proteins displaying appropriate antigenicity. 

Samples of cells suspected of being transformed are harvested and assayed 
for flie protein using suitable antibodies. 

The host cell may be a host cell within an animal body. Thus, transgenic 
animals which express a polypeptide of the fourth aspect of the invention by 
virtue of the presence of the transgene are included. Preferably, the 
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transgenic animal is a rodent such as a mouse. Transgenic animals can be 
made using methods well known in the art. 

In one embodiment of the fourth aspect of the invention, the cells are 
5 transformed or transfected with a polynucleotide according to the first 
aspect of the invention or an expression vector according to the second 
aspect of the invention, and said cells further comprise a polynucleotide 
capable of expressing Protein Kinase D or a functional equivalent thereof. 
As discussed in more detail below, such cells may be useful for 
10 investigating the interaction of Protein Kinase D and Kidsin220. 
Alternatively, cells capable of co-expressing Kidins220 and Protein Kinase 
D may be useftil for investigating or producing Kidins220 in its 
phosphorylated form. 

15 It will be appreciated that the Protein ICinase D expressed in said cells may 
be the form found in Nature, or may be a mutant form. Examples of useful 
mutant forms are constitutively active forms generated, for example, by a 
double mutation of residues serine 744 and serine 749 to glutamic acid, and 
a dominant negative form produced by mutation of aspartic acid 733 to 

20 alanine. 

Functional equivalents include these mutants and other forms of PKD (such 
as variants and fusions) that are capable of phosphorylating Kidins220. 

25 Other mutants of PKD which may be also useful in the invention are 
described in Iglesias et al (1998) FEES Lett, 437:19-23; Iglesias and 
Rozengurt (1998) J. Biol Chem, 273:410-416; Iglesias et al (1998) J. Biol 
Chem. 273:27662-27667 all incorporated herein by reference and which 
refer to PKD cDNA and its sequence. 
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In addition to the transformed host cells themselves, the present invention 
also contemplates a culture of those cells, preferably a monoclonal (clonally 
homogenous) culture, or a culture derived from a monoclonal culture, in a 
5 nutrient medium. 

A fifth aspect of the invention provide a polypeptide comprising the amino 
acid sequence given in Figure lb or Figure 1 1 or a fragment or variant or 
fusion thereof, or a fusion of said fragment or variant provided that said 
10 polypeptide is not the polypeptide encoded by GenBank Accession No. 
AB033076. 

Preferably, the polypeptide of the invention is not the polypeptide of 
WO 01/42285 identified as SEQ ID No 9. 

15 

Thus, a polypeptide of the invention includes a polypeptide whose amino 
acid sequence is given in Figure lb or a polypeptide whose sequence is 
given in Figure 11. A polypeptide of the invention which is a variant of the 
amino acid sequence given in Figure lb or Figure 1 1 includes that whose 
20 sequence is shown in Figure 18, 

A polypeptide of the invention includes a polypeptide whose amino acid 
sequence is given in Figure lb but wherein the "splicing cassette" (ie 
between residues 1140 and 1232 as indicated by the arrows in Figure lb) is 
25 replaced by any one of the "cassettes" indicated in Figure 8 between the 
square brackets. 

Preferably, a polypeptide of the invention is one which has at least 65% 
identity with the amino acid sequence given in Figure lb or Figure 1 1 or 
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which sequence is as given in Figure 8. More preferably, the polypeptide 
has at least 70% or 80%, 85% or 90% identity, and even more preferably 
the polypeptide has at least 95% or 98% identity with the amino acid 
sequence shown in Figure lb. 

5 

It will be appreciated that a polypeptide of the invention is not the 
polypeptide encoded by GeiiBank Accession No. AB033076, but a 
polypeptide of the invention may include the amino acid sequence encoded 
by GenBank Accession No. AB033076. Preferably, the polypeptide is not 
10 that of WO 01/42285 identified as SEQ ID No 9. 

Therefore a polypeptide of the invention includes homologues of Kidins220 
which are found in species other than rat, such as human, mouse, sheep and 
which are phosphorylation substrates of Protein Kinase D. It also includes 
15 variants of these that retain at least some activity of Kidins220. 

As discussed above, the polypeptide of the invention includes human 
Kidins220. 

20 Typically, the polypeptide of the invention is one which is able to interact 
with Protein Kinase D, and be phosphorylated on serine 919 (as numbered 
in Figure lb) or serine 918 (as numbered in Figure 18). Preferably, the 
polypeptide of the invention is able to bind ATP. 

25 It will be appreciated that the polypeptide of the invention may, or may not 
be phosphorylated. Where the polypeptide is phosphorylated, it may be on 
more than one residue. In particular, the polypeptide may, or may not be 
phosphorylated on serine 919 as numbered in Figure lb or an equivalent 
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10 



15 



20 



position. Both phosphorylated and non-phosphorylated forms of tiie 
polypeptide are included within the scope of the invention. 

By "equivalent position" we mean an amino acid residue in a polypeptide, 
typically of the same type, which occupies a position which is the same as 
the given residue (in this case Ser 919) when the sequence of the 
polypeptide is optimally aligned, using the computer programs discussed 
above, with the reference sequence (in this case the Figure lb sequence). 
Equivalent position, in this context, is well understood by the person skilled 
in the art. In the sequence of Figure 18, the equivalent serine is serine 918. 

It will also be appreciated that the polypeptide of the invention may or may 
not have ATP or an analogue or functional equivalent thereof bound to it. 
Both ATP-bound and non ATP-bound forms are included within the scope 
of the invention. 

"Fragments" of the Kidins220 polypeptide include polypeptides which 
comprise at least five consecutive amino acids of the polypeptide of the 
invention. Preferably, a fragment of the polypeptide comprises an amino 
acid sequence which is useful, for example, a fragment which retains 
activity of the polypeptide, or is useful as a peptide for producing an 
antibody which is specific for the Kidins220 polypeptide. Typically, the 
fragments have at least 8 consecutive amino acids, preferably at least 10, 
more preferably at least 12 or 15 or 20 or 30 or 40 or 50 consecutive amino 
acids of a polypeptide of the invention such as that shown in Figure lb or 
the human amino acid sequence shown in Figure 11. 

Particularly useful polypeptide fragments are those which comprise the 
Protein Kinase D phosphorylation recognition motif. The Protein Kinase D 
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phosphorylation recognition motif may be considered to include the motif 
(I/L)XRQMSJ, where X is any amino acid, J represents a hydrophobic 
residue such as isoleucine, leucine, phenylalanine, tryptophan, tyrosine or 
valine and (I/L) represents isoleucine or leucine. An example of such a 
5 polypeptide fragment includes the sequence RQMQRTITRQMSFDLTK. 

It will be appreciated that the serine in the sequence 
RQMQRTITRQMSFDLTK (underlined; residue 919 as numbered in Figure 
lb) which can be phosphorylated by Protein Kinase D, may be usefully 

10 changed to an amino acid residue other than serine. For example, serine 
919 may be mutated to prevent phosphorylation of the polypeptide at this 
residue by Protein Kinase D. An example of such a mutation is 
RQMQRTITRQMAFDLTK. A conservative mutation of serine 919 may 
be to a threonine residue. Polypeptides which may mimic phosphorylated 

15 serine 919 are those in which serine 919 has been changed to an aspartate or 
glutamate residue. Polypeptides or fragments thereof containing such 
mutations are useful in competition assays, or as inhibitors of Protein 
Kinase D phosphorylating activity, or as epitopes for raising antibodies. 

20 Typically, a useful fragment of the polypeptide of the invention is one 
which is antigenic. An example of an antigenic polypeptide fragment of the 
invention is CAASSESTGFGEERSIL, Such a fragment can be used to 
raise antibodies which are selective for the Kidins220 polypeptide, as 
described in more detail below. 

25 

In one embodiment of this aspect of the invention, the polypeptide fragment 
contains an amino acid sequence which is present in all of the splice variant 
forms of Kidins220. In other words, the fragment is a fragment of 
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Ridms220 outside of the splicing cassette shown in Figure lb and therefore 
common to the different Kidins220 forais. 

In another embodiment of this aspect of the invention, the polypeptide 
5 fragment contains an amino acid sequence which is present in the splicing 
cassette, ie the amino acid sequence between the arrows in Figure lb or the 
amino acid sequences within the square brackets shown in Figure 8, 
Suitable fragments which are included in this embodiment are fragments 
including the amino acid sequence HPFYNRANIN (clone 1), 
10 HPFYNRAAVP (clone 2), DQNNGLAAVP (clone 3), RQVQKLQAAV 
(clone 4), CHRQLTVTEF (clone 5) and TEDAAEGLPS (clone 6). 

It will be appreciated that fragments of the polypeptide of the invention are 
particularly useful when fused to other polypeptides, such as GST, Green 

15 Fluorescent Protein (GFP), Vesicular Stomatitis Virus Glycoprotein 
(VSVG) or Keyhole Limpit Haemacyanin (KLH). For example, a fusion of 
the fragment CAASSESTGFGEERSIL to GST may be used in methods 
well known in the art of antibody purification to purify- a polyclonal 
antibody. Fusions of the polypeptide, or fusions of fragments or variants of 

20 the polypeptide of the invention are included in the scope of the invention. 

Hence, a "fusion" of the Kidins220 pol3^eptide or a fragment or variant 
thereof provides a molecule comprising a polypeptide of the invention and a 
further portion. It is preferred that the said further portion confers a 
25 desirable feature on the said molecule; for example, the portion may useful 
in detecting or isolating the molecule, or promoting cellular uptake of the 
molecule. The portion may be, for example, a biotin moiety, a radioactive 
moiety, a fluorescent moiety, for example a small fluorophore or a green 
fluorescent protein (GFP) fluorophore, as well known to those skilled in the 
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art. The moiety may be an immunogenic tag, for example a Myc tag, as 
known to those skilled in the art or may be a lipophilic molecule or 
polypeptide domain that is capable of promoting cellular uptake of the 
molecule or the interacting polypeptide, as known to those skilled in the art. 

5 

A sixth aspect of the invention provides a peptide comprising the sequence 
(I/L)XRQM(S/A)J wherein X is any amino acid, J is a hydrophobic amino 
acid such as isoleucine, leucine, phenylalanine, tryptophan, tyrosine or 
valine, (I/L) represents isoleucine or leucine and (S/A) represents serine or 
10 alanine and which is not the polypeptide encoded by the nucleotide 
sequence in GenBank Accession No AB033076 and which is not \i peptide 
(AALVRQMSVAFFFK). 

Preferably J is V or F. 

15 

When a serine is present rather than alanine at position (S/A), the peptide is 
believed to be a substrate for Protein Kinase D. 

Preferably, the polypeptide comprises the amino acid sequence 
20 (I/L)XRQM(S/A)J and further amino acids to the N-terminus and C- 
terminus of this motif present in rat or human Kidins220 (see Figure lb and 
Figure 1 1 and Figure 18). Typically, the peptide is at least 8 amino acids in 
length, preferably at least 10 amino acids but may be 15 or 20 or 25 or 30 or 
35 or 40 amino acids in length. 

25 

Preferred peptides include the sequence RQMQRTITRQMSFDLTK, or a 
minimum sequence thereof which can be phosphorylated by PKD may be 
useful as a substrate for Protein Kinase D since the serine residue in this 
sequence is phosphorylated in Kidins220 by Protein Kinase D. 
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Further preferred peptides include the sequence 
RQMQRTITRQMAFDLTK, or a minimum sequence which can be 
phosphorylated by PKD where S has been changed to A, may be useful as a 
5 non-phosphorylatable control for Protein ICinase D, compared to the above 
polypeptides, since the phosphorylatable serine residue within the motif has 
been changed to alanine which cannot be phosphorylated. 

A seventh aspect of the invention provides a peptide comprising the amino 
10 acid sequence CAASSESTGFGEERSIL or a variant or fragment or fusion 
thereof or a fusion of said fragment or variant or an analogue thereof 
provided that it is not the polypeptide encoded by the nucleotide sequence 
in GenBank Accession No AB033076. 

15 Preferably the peptide is not a peptide of the polypeptide identified as SEQ 
ID No 9 in WO 01/42285. 

The length preferences for these peptides is as discussed previously, 

20 An eighth aspect of the invention provides a method of producing a peptide 
or polypeptide according to the fifth, sixth or seventh aspects of the 
invention comprising expressing a polynucleotide according to the second 
aspect of the invention. 

25 hi a preferred embodiment of the eighth aspect of the invention, the method 
of producing a peptide or polypeptide according to the fifth aspect of the 
invention comprises culturing a host cell according to the fourth aspect of 
the invention. 



BNSOCXJiD: <WO 02207B6A2 I > 





wo 02/20786 



PCT/GBOl/03977' 



38 



Methods of cultivating host cells and isolating recombinant proteins are 
well known in the art. It will be appreciated that, depending on the host 
cell, the Kidins220 produced may differ from that which can be isolated 
from nature. For example, certain host cells, such as yeast or bacterial 

5 cells, either do not have, or have different, post-translational modification 
systems which may result in the production of forms of Kidins220 which 
may be post-translationally modified in a different way to Kidins220 
isolated from nature. In order to obtain Kidins220 which is post- 
translationally modified in a different way to human Kidins220 or rat 

10 Kidins220 it is preferred if the host cell is a non-human or non-rat host cell; 
more preferably it is not a mammalian cell. 

It is preferred that the Kidins220 polypeptide is produced in a eukaryotic 
system, such as an insect cell. 



According to a less preferred embodiment, the Kidins220 polypeptide can 
be produced in vitro using a commercially available in vitro translation 
system, such as rabbit reticulocyte lysate or wheatgerm lysate (available 
from Promega). Preferably, the translation system is rabbit reticulocyte 

20 lysate. Conveniently, the translation system may be coupled to a 
transcription system, such as the TNT transcription-translation system 
(Promega), This system has the advantage of producing suitable mRNA 
transcript from an encoding DNA polynucleotide in the same reaction as the 
translation. Conveniently, where the expressed polypeptide comprises one 

25 or more transmembrane domains, the translation system can be 
supplemented with a source of endoplasmic reticulum-derived membranes 
and folding chaperones, such as dog pancreatic microsomes, to allow 
synthesis of the polypeptide in a native conformation. 



15 
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Preferably, the production method of the eighth aspect of the invention 
comprises a further step of isolating the Kidins220 produced jfrom the host 
cell or from the in vitro translation mix. Preferably, the isolation employs 
an antibody which specifically binds the expressed polypeptide of the 
5 invention. 

It will be understood that the invention comprises the Kidins220 
polypeptide or the variant or fragment or fusion thereof, or a fusion of said 
variant or fragment obtainable by the methods herein disclosed. 

10 

A ninth aspect of the invention provides an antibody which specifically 
binds to a polypeptide whose amino acid sequence is given in Figure lb or 
Figure 1 1, or a fragment or variant of the said polypeptide, for example, the 
polypeptide whose amino acid sequence is given in Figure 18. The 
15 antibodies include antibodies which are selective for Kidins220 and 
antibodies which are selective for the individual splice variant fomis of 
Kidins220 as described. Thus, the antibodies of the invention include those 
that specifically bind to human Kidins220 or individual splice variant forms 
thereof. 

20 

Preferably, the antibody is not an antibody of WO 01/42285 which 
selectively binds the polypeptide whose sequence is identified as SEQ ID 
No 9 therein. 

25 By "selective" we include antibodies which bind at least 10-fold more 
strongly to a polypeptide of the invention than to another polypeptide; 
preferably at least 50-fold more strongly and more preferably at least 100- 
fold more strongly. Such antibodies may be made by methods well known 
in the art using the information concerning the differences in amino acid 
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sequence between the amino acid sequence which is given in Figure lb, or a 
variant thereof, and another polypeptide which is not a polypeptide of the 
invention. In particular, the antibodies may be polyclonal or monoclonal. 

5 Suitable monoclonal antibodies which are reactive as said may be prepared 
by known techniques, for example those disclosed in "Monoclonal 
Antibodies; A manual of techniques '\ H Zola (CRC Press, 1988) and in 
"Monoclonal Hybridoma Antibodies: Techniques and Application'\ SGR 
Hurrell (CRC Press, 1982). Polyclonal antibodies may be produced which 

10 are polypepcific or monospecific. It is preferred that they are monospecific. 

One embodiment provides an antibody which selectively binds the 

Kidins220 polypeptide whose amino acid is shown in Figure lb or Figure 
18 or a human equivalent thereof but does not bind a splice variant of 
15 Kidins220 as described. 

A further embodiment provides an antibody which selectively binds a 
variant of Kidins220 as described but which antibody does not bind a 
polj^eptide whose amino acid sequence is as given in Figure lb or Figure 
20 1 8 or a human equivalent thereof. 

A still fiirther embodiment provides an antibody which selectively binds a 
particular variant or particular group of variants as described. 

25 By "group of variants" is meant those variants which have some similarity 
in the way their amino acid sequence varies fi*om the amino acid sequence 
given in Figure lb. For example, the variant forms referred to as "Clone 4" 
and "Clone 5" in Figure 8 have the amino acid sequence PCDSGFNKQRQ 
inserted between residues [1178] and [1188] (as numbered in Figure lb). 
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Hence, an antibody which selectively binds to the polypeptide sequence 
PCDSGFNKQRQ will react only with the group of variant forms referred to 
as "Clone 4" and "Clone 5'' in Figure 8 and not with the amino acid " 
sequence given in Figure 1, or with any of the other variants shown in 
Figure 8. Another example of a group of variants are the amino acid 
sequence given in Figure lb and the variants referred to as Clone 3, 4, 5 and 
6 in Figure 8, excluding the variants referred to as Clone 1 and 2. Both 
"Clone 1" and "Clone 2" in Figure 8 have a deletion of amino acids 
between residues [1140] and [1191]. Hence an antibody which selectively 
binds to the polypeptide sequence FFAPYLYTPR will not bind to the 
variant forms termed Clones 1 and 2 in Figure 8, but will bind to the 
sequence given in Figure 1 and to the variant forms referred to as Clones 3, 
4, 5 and 6 in Figure 8. 

Antibodies may distinguish between the different splice variants when able 
to bind to peptides unique to the different variants. Examples of such 
unique peptides are: clone 1, HPFYNRANIN; clone 2, HPFYNRAAVP; 
clone 3, DQNNGLAAVP; clone 4, RQVQKLQAAV; clone 5, 
CHRQLTVTEF; clone 6, TEDAAEGLPS. For clones 1-3, these peptides 
represent the sequence spanning the point of fusion between the splice 
segments. The sequences suggested for clones 4-6 represent unique 
insertions in the variant. Such antibodies may be especially useful for 
monitoring a specific variant when used in conjunction with antibodies 
which recognise all forms of Kidins220 (such as one raised to the 21 amino 
acids at the C-temiinus of Kidins220) so indicating the presence of the 
protein. As is discussed in more detail below, such antibodies may also be 
useful in medicine to monitor the presence or location of one or more forms 
of Kidins220, for example, in diagnosing cancer or monitoring the 
progression of a cancer or its treatment. 
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Hence, tlie use of an antibody according to the invention in medicine, for 
example, in diagnosis, prognosis or treatment of cancer, is contemplated. 

5 The person skilled in the art can readily determine peptides which will be 
iiseful in making selective antibodies based on the information in Figures lb 
and 8 and 18. 

Further preferred antibodies are ones which distinguish Kidins220 
10 phosphoiylated at serine 919 from rat Kidins220 (or an equivalent residue) 
not phosphorylated at serine 919. Such antibodies can be made, for 
example by immimising or selecting with suitable peptides which contain 
suitable portion of Kidin220 surrounding serine 919 and wherein the serine 
residue is phosphorylated and determining whether the antibody is selective 
15 for the phosphorylated form (ie does not substantially bound to the non- 
phosphorylated form). 

Peptides in which one or more of the amino acid residues are chemically 
modified, before or after the peptide is synthesised, may be used providing 

20 that the function of the peptide, namely the production of specific antibodies 
in vivo, remains substantially unchanged. Such modifications included 
forming salts with acids or bases, especially physiologically acceptable 
organic or in organic acids and bases, forming an ester or amid of a terminal 
carboxyl group, and attaching amino acid protecting groups such as N-t- 

25 butoxycarbonyl. Such modifications may protect the peptide from in vivo 
metabolism. The peptides may be present as single copies or as multiples, 
for example tandem repeats. Such tandem or multiple repeats may be 
sufficiently antigenic themselves to obviate the use of a carrier. It may be 
advantageous for the peptide to be formed as a loop, with the N-terminal 
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V 



and C-temiinal ends joined together, or to add one or more Cys residues to 
an end to increase antigenicity and/or to allow disulphide bonds to be 
formed. If the peptide is covalently linked to a carrier, preferably a 
polypeptide, then the airangement is preferably such that the peptide of the 
5 invention forms a loop. 

According to current inamunological theories, a caixier function should be 
present in any immunogenic formulation in order to stimulate, or enhance 
stimulation of, the immune system. It is though that the best carriers 

10 embody (or, together with the antigen, create) a T-cell epitope. The 
peptides may be associated, for example by cross-linking, with a separate 
carrier, such as serum albumins, myoglobins, bacterial toxoids and keyhole 
limpit haemocyanin. More recently developed carriers which induce T-cell 
help in the immune response include the hepatitis-B core antigen (also 

15 called the nucleocapsid protein), presumed T-cell epitopes such as Thr-Ala- 
Ser-Gly-Val-Ala-Glu-Thr-Thr-Asn-Cys, p-galactosidase and the 163-171 
peptide of interleukin-1 . The latter compound may variously be regarded as 
a caixier or as an adjuvant or as both. Alternatively, several copies of the 
same or different peptides of the invention may be cross-linked to one 

20 another; in this situation there is no separate carrier as such, but a carrier 
function may be provided by such cross-linking. Suitably cross-linking 
agents include those listed as such in the Sigma and Pierce catalogues, for 
example glutaraldehyde, carbodiimide and succinimidyl 4-(N- 
maleimidomethyl)cyclohexane-l-carboxylate, the latter agent exploiting the 

25 -SH group on the C-terminal cysteine residue (if present). 

If the peptide is prepared by expression of a suitable nucleotide sequence in 
a suitable host, then it may be advantageous to express the peptide as a 
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fusion product with a peptide sequence which acts as a carrier. Kabigen's 
"Ecosec" system is an example of such an arrangement. 

Peptides may be synthesised by the Fmoc-poly amide mode of solid-phase 

5 peptide synthesis as disclosed by Lu et al (1981) J, Org. Chem. 46, 3433 
and references therein. Temporary N-amino group protection is afforded by 
the 9-fluorenylmethyloxycarbonyl (Fmoc) group. Repetitive cleavage of 
this highly base-labile protecting group is effected using 20% piperidine in 
N,N-dimethylformamide. Side-chain functionalities may be protected as 

10 their butyl ethers (in the case of serine threonine and tyrosine), butyl esters 
(in the case of glutamic acid and aspartic acid), butyloxycarbonyl derivative 
(in the case of lysine and histidine), trityl derivative (in the case of cysteine) 
and 4-methoxy-2,3,6-trimethylbenzenesulphonyl derivative (in the case of 
arginine). Where glutamine or asparagine are C-terminal residues, use is 

15 made of the 4,4'-dimethoxyben2hydryl group for protection of the side 
chain amido functionalities. The soUd-phase support is based on a 
polydimethyl-acrylamide polymer constituted from the three monomers 
dimethylacrylamide (backbone-monomer), bisacryloylefhylene diamine 
(cross linker) and acryloylsarcosine methyl ester (functionalising agent). 

20 The peptide-to-resin cleavable linked agent used is the acid-labile 4- 
hydroxjonethyl-phenoxyacetic acid derivative. AH amino acid derivatives 
are added as their preformed symmetrical anhydride derivatives with the 
exception of asparagine and glutamine, which are added using a reversed 
N,N-dicyclohexyl-carbodiimide/l-hydroxybenzotriazole mediated coupling 

25 procedure. All coupling and deprotection reactions are monitored using 
ninhydrin, trinitrobenzene sulphonic acid or isotin test procedures. Upon 
completion of synthesis, peptides are cleaved from the resin support with 
concomitant removal of side-chain protecting groups by treatment with 95% 
trifluoroacetic acid containing a 50% scavenger mix. Scavengers 
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coiximonly used are ethanedithiol, phenol, anisole and water, the exact 
choice depending on the constituent amino acids of the peptide being 
synthesised. Trifluoroacetic acid is removed by evaporation in vacuo, with 
subsequent trituration with diethyl ether affording the crude peptide. Any 
5 scavengers present are removed by a simple extraction procedure which on 
lyophiHsation of the aqueous phase affords the crude peptide free of 
scavengers. Reagents for peptide synthesis are generally available from 
Calbiochem-Novabiochem (UK) Ltd, Nottingham NG7 2QJ, UK. 
Purification may be effected by any one, or a combination of, techniques 

10 such as size exclusion chromatography, ion-exchange chromatography and 
(principally) reverse-phase high performance liquid chromatography. 
Analysis of peptides may be carried out using thin layer chromatography, 
reverse-phase high performance liquid chromatography, amino-acid 
analysis after acid hydrolysis and by fast atom bombardment (FAB) mass 

15 spectrometric analysis. 

The peptide of the invention may be linked to other antigens to provide a 
dual effect. 

20 In a preferred embodiment of the invention, an antibody of the invention is 
capable of preventing or disrupting the interaction between Kidins220 
polypeptide and Protein Kinase D. 

It will be appreciated that other antibody-like molecules may be useful in 
25 the practice of the invention including, for example, antibody fragments or 
derivatives which retain their antigen-binding sites, synthetic antibody-like 
molecules such as single-chain Fv fragments (ScFv) and domain antibodies 
(dAbs), and other molecules with antibody-like antigen binding motifs. 
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Such antibody-like molecules are included by the term **antibody" as used 



above. 



A further aspect of the invention provides a method of producing an 
5 antibody according to the ninth aspect of the invention. 

Antibodies can be raised in an animal by immunising with an appropriate 
peptide. Appropriate peptides are described herein. Altematively, with 
today*s technology, it is possible to make antibodies as defined herein 
10 without the need to use animals. Such techniques include, for example, 
antibody phage display technology as is well known in the art. Appropriate 
peptides, as described herein, may be used to select antibodies produced in 
this way. 

15 It will be appreciated that, with the advancements in antibody technology, it 
may not be necessary to immunise an animal in order to produce an 
antibody. Synthetic systems, such as phage display libraries, may be used. 
The use of such systems is included in the methods of the invention and the 
products of such systems are "antibodies" for the purposes of the invention. 



It will be appreciated that such antibodies which recognise Kidins220 and 
variants or fragments thereof are useful research reagents. Suitably, the 
antibodies of the invention are detectably labelled, for example they may be 
labelled in such a way that they may be directly or indirectly detected. 
25 Conveniently, the antibodies are labelled with a radioactive moiety or a 
coloured moiety or a fluorescent moiety, or they may be linked to an 
enzyme. Typically, the enzyme is one which can convert a non-coloured 
(or non-fluorescent) substrate to a coloured (or fluorescent) product. The 
antibody may be labelled by biotin (or streptavidin) and then detected 



20 
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10 



15 



20 



indirectly using streptavidin (or biotin) which has been labelled with a 
radioactive moiety or a coloured moiety or a fluorescent moiety, or the like 
or they may be linked to any enzyme of the tyqpe described above. 

A tenth aspect of the present invention provides a method of modulatiag the 
activity of the polypeptide whose amino acid sequence is given in Figure lb 
or a jSragment or variant or fusion thereof, the method comprising contacting 
the polypeptide with an antibody according to the seventh aspect of the 
invention. Thus, the activity of Kidins220 or a jfragment or variant thereof 
can be modulated by contacting it with a suitable antibody as defined. 

In a preferred embodiment, the antibody is specific for the form of 
Kidins220 which is non-phosphorylated on serine 919 (as numbered in 
Figure lb). Addition of such an antibody may modulate the function of the 
polypeptide of the invention by inhibiting phosphorylation of the serine at 
residue 919 by preventing access of Protein Kinase D to the 
phosphorylation site. A lack of a phosphate group on serine 919 of 
Kidins220 may cause degradation of Kidins220 by an intracellular 
mechanism such as ubiquitination and proteosomal degradation. 

In an alternative embodiment of the tenth aspect of the invention, the 
antibody may be specific for the form of Kidins220 which is not boxmd to 
ATP, or the form which is bound to ADP. Addition of such an antibody 
may modulate the function of the polypeptide of the invention by inhibiting 
the binding of ATP to the ATP-binding site. Such an inhibition may 
modulate the activity or function of the Kidins220 pol3Apeptide by 
maintaining the polypeptide in a non-active state, or a constitutively active 
form, depending on the role of ATP in the function of Kidins220. 
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In a further embodiment of the tenth aspect of the invention, the activity of 
the Kidins220 polypeptide, or a variant or fragment or fusion thereof, or a 
fusion of said fragment or variant may be modified by binding of an 
antibody to a region involved in protein-protein interactions. Such a region 

5 of the Kidins220 protein is the proline-rich region which may be involved in 
binding adaptor modules such as SH3 and WW domains, or the C-terminal 
portion with a high serine and threonine content which may participate in 
interactions with serine/threonine binding modules. Further such regions 
are the ankyrin repeat regions located at amino acid 17-391 (as numbered 

10 in Figure 18). 



An eleventh aspect of the invention comprises a method of modulating the 
function of the polypeptide whose amino acid sequence is given in Figure 
lb or a fragment or variant or fusion thereof when in the presence of Protein 

15 Kinase D, the method comprising inhibiting the phosphorylating ability of 
Protein Kinase D. Typically, PKX> is inhibited so that so that Kidins220 
remains non-phosphorylated on serine 919 when in the presence of PKD. 
This inhibition of Protein Kinase D phosphorylating activity may be 
achieved by addition of an antibody which either specifically binds the 

20 active site of Protein Kinase D, or which specifically binds a site of 
activation or regulation, such as a phosphorylation or diacylglycerol binding 
domain (Iglesias and Rozengurt (1999) FEES Lett 454:53-56), of Protein 
Kinase D, which antibody binding prevents the activation of Protein Kinase 
D necessary for its Kidins220-phosphorylating activity. A suitable target 

25 phosphorylation site on Protein Kinase D includes the two activation loop 
sites in the carboxy-terminal catalytic domain described by Iglesias et al 
(1998) J. Biol Chem. 273:27662-27667) and Waldron et al (1999). 
Another site which may be a suitable target of antibody binding in order to 
inhibit the phosphorylating activity of Protein Kinase D is the amino- 
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terminal pleckstrin homology (PH) domain of the protein that regulates its 
activity (Iglesias et al (1998) J. Biol Chem 273:410-416), 

A twelfth aspect of the invention provides a method of modulating the 
5 activity of the pol3^eptide whose amino acid sequence is given in Figure lb 
or a fragment or variant or fusion thereof, the method comprising reducing 
the ability of said polypeptide to bind ATP. 



In a preferred embodiment, the abihty of the Kidins220 polypeptide to bind 
10 to ATP is modulated by addition of a small molecule which binds the ATP- 
binding 'pocket', or a non-hydrolysable ATP analogue. Such an analogue 
is 8-azido-ATP, which in the presence of UV light covalently modifies 
ATP-binding proteins and prevents binding of a hydrolysable ATP 
molecule. 

15 

A thirteenth aspect of the invention provides a method of identifying an 
agent which modulates the promoter activity of the polynucleotide 
comprising the nucleotide sequence given in Figure 16 or a firagment or 
variant thereof, said method comprising i) introducing said polynucleotide 
20 into a suitable cell; ii) exposing a test agent to said cell and; iii) detecting 
whether (and, optionally, to what extent) the activity of the polynucleotide 
has been altered. 



From the sequence presented in Figure 16, the skilled person can identify 
25 the promoter sequence or regulatory region contained therein using no more 
than routine experimentation. 

Typically, the polynucleotide further comprises a polynucleotide capable of 
expressing a polypeptide, which polypeptide is a detectable one. Suitable 
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detectable polypeptides include green fluorescent protein, enzymes capable 
of producing coloured or detectable products from a colourless or 
undetectable substrate (for example, horseradish peroxidase), or polypeptide 
for which there is a specific antibody readily available. Agents which 
5 produce an increase in tiie amount of the polypeptide expressed are 
stimulators of the promoter activity of the polynucleotide. Agents which 
produce a decrease in the amount of polypeptide expressed are inhibitors of 
the promoter activity of the polypeptide. 

10 Suitable cells are typically neuronal cells, such as PC12 cells or other 
neuronal cell lines, but may also be primary neuronal cells. 

Agents identified by this method may be useful in enhancing (in the case of 
stimulators of promoter activity) or decreasing (in the case of inhibitors of 
15 promoter activity) the level of expression of polypeptides in neuronal cells. 
Modulation of expression of a polj^eptide may be useful in the treatment of 
neurogenerative disease where the disease arises as a result of excessive or 
inadequate levels of a polypeptide within the neuronal cells. 

20 Agents identified by this method may also be useful in manipulating the 
level of expression of polypeptides in neuronal cells in culture. 

A fourteenth aspect of the invention provides a method of identifying an 
agent which modulates the activity of the polypeptide whose amino acid 
25 sequence is given in Figure lb, or a fi:agment or variant or fusion thereof, or 
a fusion of said fragment or variant said method comprising: i) exposing a 
test agent to said polypeptide and; ii) detecting whether (and, optionally, to 
what extent) the activity of said polypeptide has been altered. 
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Typically, the polypeptide is the polypeptide whose amino acid sequence is 
given in figure lb or a variant or fusion or fragment which is a functional 
equivalent thereof. 



5 By "functional equivalent" we mean that the said component is one whose 
activity is substantially the same as the Kidins220 polypeptide activity 
which is being sought to be modified by the method, but which may be 
more convenient to use in an assay. 



10 By "determining whether (and, optionally, to what extent) said polypeptide 
has been altered" we mean that tiie activity of the polypeptide when 
exposed to the test substance is compared to the activity of the polypeptide 
when not exposed to that test substance under the same assay conditions. It ^ 
is understood that the activity may be increased, or decreased as a result of 

15 exposure to the test substance or it may not be altered. 

Typically, the activity of said Kidins220 polypeptide which is measured in 
the assay may be any one or more of its ability to interact with another 
polypeptide, its ability to become ubiquitinated, its ability to become 
20 phosphorylated, its ability to bind ATP and its abilitj^ to dissociate from 
ADP. An example of another polypeptide with which Kidins220 may 
interact is Protein Kinase D. Kidins220 may interact with TrkA and p75 
neurotrophin receptors. An example of an amino acid residue in Kidins220 
which may become phosphorylated is serine 919 as numbered in Figure lb. 

25 

Phosphorylation and ubiquitination can be measured easily using a standard 
ELISA assay with antibody against the phosphorylated form of Kidins220, 
or against ubiquitin. Protein interaction may be measured using standard 





wo 02/20786 



PCT/GBOl/03977 



52 



immunoprecipitation techniques where an antibody used in the 
immunoprecipitation is detectably labelled, or becomes detectably labelled. 

The enhancenrient or disruption or prevention of an interaction between the 
5 Kidins220 polypeptide and another polypeptide can be measured in vitro 
using methods well known in the art of biochemistry and include any 
methods which can be used to assess protein-protein interactions. Suitable 
methods include co-purification, ELISA and co-immunoprecipitation 
methods. Further suitable methods may include Scintillation Proximity 
10 Assays, as well as other techniques well known to those skilled in the art. 

The said interaction can also be measured within a cell, for example using 
the yeast two hybrid system or green fluorescent protein (GFP)-based 
systems as is well known in the art. 



In one embodiment, a host cell according to the fourth aspect of the 
invention which comprises a polynucleotide capable of expressing a 
Kidins220 polypeptide according to the fifth aspect of the invention, and a 
further polynucleotide capable of expressing Protein Kinase D may be 
20 suitable for screening for agents which enhance or dismpt or prevent an 
interaction between Kidisn220 and Protein Kinase D. 

Such agents identified by said screening method may be useful in research 
into the effect of Protein Kinase D phosphorylation in cellular function. 
25 Such agents may also be useful as medicaments for treating pathologies 
which arise due to aberrant phosphorylation by Protein Kinase D. 



15 
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Agents identified by the screening method may be useful in their own right, 
or may be useful as lead compounds for tiiie development of other 
compounds which display greater efficacy. 

A fifteenth aspect of the invention provides a method of identifying an 
agent able to modulate the activity of Protein Kinase D, the method 
comprising: i) exposing Protein Kinase D or a functional equivalent thereof 
to a peptide or polypeptide comprising the amino acid sequence 
(I/L)XRQMSJ, where X is any amino acid, J represents a hydrophobic 
residue such as isoleucine, leucine, phenylalanine, tryptophan, tyrosine or 
valine and (I/L) represents isoleucine or leucine; and ii) detecting in the 
presence or absence of the test agent whether (and, optionally, to what 
extent) said peptide or polypeptide has been phosphorylated, provided that 
the peptide is not peptide (AALVRQMSVAFFFK). 



Additionally or alternatively, the polypeptide exposed to PKD may 
comprise a polypeptide coomprising at least five consecutive residues 
including residue 919 of the polypeptide as numbered in Figure lb. 
Preferably, the peptide is one including the sequence 
RQMQRTITRQMSFDLTK or a minimum sequence thereof which can be 
phosphorylated by PKD. 

Typically, the test agent of step i) is exposed to the Kidins220 polypeptide, 
or variant or fragment or fusion or fusion of said fragment or variant, prior 
to being exposed to Protein Kinase D. 

In one embodiment, the test agent is exposed to Protein Kinase D prior to 
being exposed to the Kidins220 polypeptide, or variant or fragment or 
fusion or fusion of said fragment or variant. 
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Functional equivalents of Protein Kinase D include forms of PKD such as 
variants and fusions, as discussed above, which are capable of 
phosphorylating Kidins220. It will be appreciated that the presence of the 
5 test agent may affect the activity of Protein Kinase D either positively or 
negatively- By this we mean that the ability of Protein Kinase D to 
phosophorylate the target polypeptide may be enhanced, or it may be 
decreased. Where the phosphoylating activity of Protein Kinase D is 
enhanced in the presence of the test agent, the level of phosphorylation of 
10 the polypeptide will be greater than that achieved in tiie absence of the test 
agent. 

Detection of the presence and/or amount of phosphate associated with the 
Kidins220 polypeptide after the assay may be conducted by any convenient 
15 means. Generally detection may be accomplished by using labelled (e.g. 
radiolabelled) phosphate in free solution or attached to a substrate, and 
comparing the amount associated with the substrate before and after the 
assay. Alternatively, antibodies which distinguish the phosphorylated form 
of Kidins220 and the non-phosphorylated fomi can be used. 

20 

It will be appreciated that the invention provides screening assays for dmgs 
which may be useful in modulating tihie activity of Kidins220 or its 
interactions, or the activity of Protein Kinase D. The compounds identified 
in the methods may themselves be useful as a drug or they may represent 
25 lead compounds for the design and synthesis of more efficacious 
compounds. 

A sixteenth aspect of the invention provides a compound identifiable or 
identified by the screening methods of the invention. 
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The invention provides the use of Kidins220 {in viti'o or in vivo), or a 
fragment or variant or fusion thereof, or a insion of said variant or fragment, 
or a ftmctional equivalent thereof in the modulation of Protein Kinase D 
5 function. A composition comprising such agents is also included in the 
present invention. 

Further aspects of the invention provide agents according to the sixteenth 
aspect of the invention, polynucleotides according to the first and second 
10 aspects of the invention and polypeptides or peptides according to the 
fourth, fifth, sixth or seventh aspects of the invention for use in medicine. 

Preferable, the use in medicine is not a use in inflammation, trauma or cell 
proliferation. 

15 

Peptides based on Kidins220 as described herein which contain a 
phosphorylation site for PKD may be used to compete for PKD-mediated 
phosphorylation of Kidins220. These peptides may be used to down- 
regulate the activity of PKD on Kidins220 and other substrates of PKD. 

20 

Said agents or polynucleotides or polypeptides are useful for treatment of 
neurodegenerative disease. Neurodegenerative disease may arise due to a 
patient possessing an aberrant form of either PKD or Kidins220. 
Impairment in the function of either PKD or Kidins220 might determine 
25 failure in neuronal differentiation or failure in neuronal maintenance. 
Administration of an agent able to increase function of Kidins220, or of a 
polynucleotide or polypeptide able to restore normal Kidins220 function or 
expression level within the patient may be an effective treatment of the 
neurodegenerative disease. 
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In one embodiment, treatment of the neurodegenerative disease may be by 
gene therapy. A preferred expression vector for use in such gene therapy is 
one suitable for use in gene therapy according to the second aspect of the 
5 invention. Where it is known that the neurodegenerative disorder has arisen 
through inadequate function of PKD, an expression vector suitable for use 
in treatment of the disease may be one which expresses a fully functional 
form of PKD or a suitable equivalent thereof (for example, a fragment of 
the full length PKD polypeptide wliich is able to phosphorylate Kidins220) 
10 which is able to restore the missing PKD function to the patient. 

Gene therapy may be carried out according to generally accepted methods, 
for example, as described by Friedman, 1991. A virus or plasmid vector 
(see further details below), containing a copy of the gene linked to 

15 expression control elements and capable of replicating inside neuronal cells, 
is prepared. Suitable vectors are known, such as disclosed in US Patent 
5,252,479 and PCT published application WO 93/07282, The vector is then 
injected into the patient, eitiier locally at the site of the neurodegeneration, 
or systemically. If the transfected PKD or Kidins220 polynucleotide is not 

20 permanently incorporated into the genome of each of the targeted neuronal 
cells, the treatment may have to be repeated periodically. 



Gene transfer systems known in the art may be useful in the practice of the 
gene therapy methods of the present invention. These include viral and 
25 nonviral transfer methods. A number of viruses have been used as gene 
transfer vectors, including papovaviruses, eg SV40 (Madzak et al, 1992), 
adenovirus (Berkner, 1992; Berkner et al, 1988; Gorziglia and Kapikian, 
1992; Quantin et al, 1992; Rosenfeld et al, 1992; Wilkinson et al 1992; 
Stratford-Perricaudet et al, 1990), vaccinia virus (Moss, 1992), adeno- 



cr.W)- <v/rj npc>n7R-.AP i > 




\\6 02/20786 PCT/GBOl/03977 

57 

associated virus (Muzyczka, 1992; Ohi et al, 1990), herpesviruses including 
HSV and EBV (Margolskee, 1992; Johnson et al 1992; Fink et al, 1992; 
Breakfield and Geller, 1987; Freese et al, 1990), and retroviruses of avian 
(Brandy opadhy ay and Teniin, 1984; Petropoulos et al., 1992), murine 
5 (Miller, 1992; Miller et al, 1985; Sorge et al, 1984; Mann and Baltimore, 
1985; Miller et al, 1988), and human origin (Shimada et al, 1991; Helseth et 
al, 1990; Page et al, 1990; Buchschacher and Panganiban, 1992). To date 
most human gene therapy protocols have been based on disabled murine 
retroviruses. 

10 

Nonviral gene transfer methods known in the art include chemical 
techniques such as calcium phosphate coprecipitation (Graham and van der 
Eb, 1973; Pellicer et al, 1980); mechanical techniques, for example 
microinjection (Anderson et al, 1980; Gordon et al, 1980; Brinster et al, 

15 1981; Constantini and Lacy, 1981); membrane fusion-mediated transfer via 
liposomes (Feigner et al, 1987; Wang and Huang, 1989; Kaneda et al, 1989; 
Stewart et al, 1992; Nabel et al, 1990; Lim et al, 1992); and direct DNA 
uptake and receptor-mediated DNA transfer (Wolff et al, 1990; Wu et al, 
1991; Zenke et al, 1990; Wu et al, 1989b; Wolff al, 1991; Wagner et al, 

20 1990; Wagner et al, 1991; Gotten et al, 1990; Curiel et al, 1991a; Curiel et 
al, 1991b). 

Other suitable systems include the retroviral-adenoviral hybrid system 
described by Feng et al (1997) Nature Biotechnology 15, 866-870, or viral 
25 systems witii targeting ligands such as suitable single chain Fv fragments. 

In an approach which combines biological and physical gene transfer 
methods, plasmid DNA of any size is combined with a polylysine- 
conjugated antibody specific to the adenovirus hexon protein, and the 
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resulting complex is bound to an adenovirus vector. The trimolecular 
complex is then used to infect cells. The adenovirus vector permits efficient 
binding, internalization, and degradation of the endosome before the 
coupled DNA is damaged. 

Liposome/DNA complexes have been shown to be capable of mediating 
direct in vivo gene transfer. While in standard liposome preparations the 
gene transfer process is nonspecific, localized in vivo uptake and expression 
have been reported in tumour deposits, for example, following direct in situ 
administration (Nabel, 1992). 

Gene transfer techniques which target DNA directly to tissues, eg epithelial 
cells of the breast or ovaries, is preferred. Receptor-mediated gene transfer, 
for example, is accomplished by the conjugation of DNA (usually in the 
form of covalently closed supercoiled plasmid) to a protein ligand via 
polylysine. Ligands are chosen on the basis of the presence of the 
corresponding ligand receptors on the cell surface of the target cell/tissue 
type. One appropriate receptor/ligand pair for introduction of the 
therapeutic gene into breast tumour cells may include the estrogen receptor 
and its Hgand, estrogen (and estrogen analogues). These ligand-DNA 
conjugates can be injected directly into the blood if desired and are directed 
to the target tissue where receptor binding and internalization of the DNA- 
protein complex occurs. To overcome the problem of intracellular 
destruction of DNA, coinfection with adenovirus can be included to disrupt 
endosome function. 

A further embodiment of the invention provides a method of treating a 
neurodegenerative disease, such as failure of adrenal function, comprising 
the step of administering to the patient an effective amount of either 
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Kidins220 or PKD polypeptide or a fragment or variant or fusion thereof to 
ameliorate the neurodegenerative disease. 



Further aspects of the invention provide a pharmaceutical composition 
5 comprising an agent identified or identifiable by the methods of the 
sixteenth aspect of the invention or a polynucleotide according to the first or 
second aspects of the invention or a polypeptide according to the fifth, sixth 
or seventh aspects of the invention and a phatmaceutically acceptable 
carrier. 

10 

By "pharmaceutically acceptable" is included that the fomiulation is sterile 
and pyrogen free. Suitable pharmaceutical carriers are well known in the art 
of pharmacy. 

15 Further aspects of the invention provide kits of parts that are useful in 
carrying out the screening methods of the invention or for phosphorylating a 
polypeptide of the invention by Protein Kinase D. Such a kit may be useful 
in research where the effect of phosphorylation of serine 919 in Kidins220 
is of interest in investigating the cell biology of a disease state. 

20 

Thus, one kit includes Protein Kinase D or a functional equivalent thereof 
and a polypeptide whose amino acid sequence is known in Figure lb or a 
fragment or variant or fusion thereof. A preferred variant is the polypeptide 
whose amino acid sequence is shown in Figure 18. 

25 

Preferably, the Kidins220-related polypeptide present in the kit comprises 
an amino acid sequence of at least five consecutive amino acids including 
serine 919 as numbered in Figure lb. Typically, however, the kit contains a 
polypeptide which contains the amino acid sequence 
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RQMQRTITRQMSFDLTK. The kit may additionally or alternatively 
contain a polypeptide comprising the amino acid sequence (I/L)XRQMSJ, 
where X is any amino acid, J represents a hydrophobic residue such as 
isoleucine, leucine, phenylalanine, tryptophan, tyrosine or valine and (I/L) 
represents isoleucine or leucine. 

The Protein Kinase D of the kit may be a constitutively active form of 
Protein Kinase D, which may be useful in research into the effects of 
aberrant phosphorylation of Kidins220. A constitutively active form may 
be made by a double mutation of Protein Kinase D as described above. 

The kit may, or may not further comprise a means of detecting 
phosphorylation of a protein. Such a means may be by provision of a 
labelled (eg radiolabelled) phosphate compound such as ATP, suitable as a 
phosphate donor in a phosphorylation reaction, or an antibody which 
specifically binds the phosphoiylated form of the Kidins220 polypeptide 
and does not bind the non-phosphoiylated polypeptide. Conveniently, the 
antibody would be labelled with a detectable group as described above. 
Conveniently, the kit comprises an appropriate means for detecting the 
label, such as autoradiograph film or an enzyme-hnked chemiluminescent 
assay. Some of the kit components may be locahsed on a surface, such as a 
blotting membrane, or an assay plate for EUSA etc, although the assay can 
be carried out in solution. 

A still further aspect of the invention provides a mutant animal wherein if 
the organism contains in its genome a polynucleotide which encodes a 
polypeptide comprising the amino acid sequence in Figure lb or a fragment 
or a variant thereof the polynucleotide in the mutant organism is mutated or 
absent. 
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Such mutant animal can be made by gene knockout and gene replacement 
technology as is well Icaown in the art. The organism may be any suitable 
organism such as Drosophila or a rodent. Preferably, however, the 

5 organism is a non-human mammal such as a rodent such as a rat or mouse. 
A suitable method of making such mutants in C.elegans is given in Kraemer 
et al (1999), Cwrr. BioL 9:1009-1018, and in mice in Torres and Kuehn, 
(1997) Laboratory Protocols for Conditional Gene Targeting^ Oxford 
University Press. The method of RNA interference for gene silencing is 

10 described in Bosher and Labouesee (2000) Nature Cell BioL 2:E3 1-E36. 

A yet further aspect of the invention provides a method of diagnosing 
cancer in a patient comprising the steps of 



(ii) comparing the level of expression of step (i) with that in a 
control sample. 

The level of expression of the Kidins220 gene in the sample may be 
20 determined by any convenient means. For example, the amount of mKNA 
encoding the Kidins220 polypeptide can be detected using detectably 
labelled polynucleotide probes which are capable of selectively hybridising 
to the mRNA. Suitable polynucleotide probes are those according to the 
present invention which are described in more detail above. Alternatively 
25 or additionally, the amount of the Kidins220 polypeptide may be detected 
using molecules which are capable of selectively binding to the Kidins220 
polypeptide. Suitable molecules include antibodies selective for Kidins220 
of the present invention, such as those described in more detail above. 



15 



(i) 



detemiining the level of expression of the Kidins220 gene in a 
sample from the patient containing protein or nucleic acid; 
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According to one embodiment of this aspect of the invention, the 
subcellular location of the Kidins220 gene product is also determined in 
addition to the level of expression- 

5 A change in the cellular location of the Kidins220 gene product (tiie 
Kidins220 polypeptide) from the plasma membrane to tihe nucleus or 
cytoplasm may indicate that the cell is a cancerous or pre-cancerous one. 
The change may be observed as an increase in the proportion of nuclear or 
cytoplasmic Kidins220 polypeptide compared to a control sample, or as a 
10 presence of nuclear or cytoplasmic Kidins220 compared to an absence of 
nuclear or cytoplasmic Kidins220 in the control sample. 



By "cellular location" we include the meaning of which cellular 
compartment the polypeptide is observed to be in. The term "cellular 

15 compartment" is well understood in the art, and includes the plasma 
membrane, the cytosol, the nucleus, endoplasmic reticulum, Golgi complex, 
peroxisome and so on. The presence of nuclear or cytoplasmic Kidins220 
polypeptide, or an increase in the proportion of Kidins220 polypeptide in 
the nucleus or cytoplasm, may, in addition to an increase in the expression 

20 level of the Kidins220 gene, further indicate that the cancer is progressing 
or developing into a more advanced cancerous stage. In this embodiment, it 
may be useful to compare the cellular location of the Kidins220 polypeptide 
to that of a control sample. 

25 Methods of determining cellular location for a polypeptide are known in the 
art of cell biology, and include immunofluorescence and confocal 
microscopy, and subcellular fractionation. Methods for quantitating the 
(relative) amounts of polypeptide in specific subcellular locations are also 
known, and include the use of computer software packages which analyse 
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and quantitate the data produced by, for example, labelled antibodies in 
immunofluorescence. 

According to a further embodiment, the phosphorylation status of the 
Kidins220 polypeptide in the sample from the patient is also determined in 
addition to the level of expression and compared to the phosphorylation 
status in a control sample. As discussed above and in Kong et al (2001) J. 
Neurosci. 21:176-185, the Kidins220 polypeptide may be phosphorylated on 
several residues, including on serine 919 (or its equivalent at serine 918 
shown in Figure 18) and on at least one tyrosine. A change in tlie 
phosphorylation status of the Kidins220 polypeptide in a sample from the 
patient (for example, an increase or a decrease in the total number of 
phosphoiylated residues or a change in the phosphorylation of any one or 
more residues) compared to the phosphorylation status of Kidins220 in a 
control sample may be indicative of a cancerous or pre-cancerous state. 

By "phosphorylation status is determined" we include the meaning that the 
number of phosphorylated residues in Kidins220 is determined, and/or the 
type of residues (for example, tyrosine or serine or threonine) which are 
20 phosphorylated and/or the presence or absence of a phosphate group on a 
specific residue is determined. 

Methods for detecting phosphorylation on amino acid residues are well known in 
the art, and include the use of anti-phosphopeptide antibodies, for example, the 
25 anti-phosphotyrosine antibody pY99 available from Santa Cruz Biotechnology, 
Santa Cruz, OA, or anti-phosphoserine antibodies which selectively bind 
phosphorylated serine 919 (or its equivalent) as described above. 
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Glioblastomas are a less differentiated type of astrocytic tumour, and 
originate from astrocytomas and anaplastic astrocytomas undergoing 
neoplastic progression. The lack of differentiation means a particularly 
unfavourable prognosis and makes histopathological classification and 
grading particularly difficult. Routine treatment for glioblastoma 
multiforme involves surgical tumour reduction and cycles of radiation and 
chemotherapeutic treatment; however, these are mostly ineffective. The 
Kidins220 polypeptide is over-expressed in the cells of a variety of tumours, 
especially in glioblastoma multiforme and prostate cancers. Hence, 
detecting over-expression of the Kidins220 polypeptide, or detecting over- 
expression of the encoding polynucleotide may be indicative of a cancerous 
or precancerous state in those cells and allow diagnosis, or aid in the 
diagnosis, of cancer, especially prostate cancers and cancers of astrocytic 
origin such as glioblastoma multiforme. Early, and correct diagnosis of 
glioblastoma multiforme in particular offers the possibility for a prompt and 
suitable treatment which may produce a more favourable outcome. 

The sample may be any suitable sample which includes protein or nucleic 
acid as required. The sample may be directly derived from the patient, for 
example, by biopsy of a tissue which may be associated with aberrant 
vascular development, or it may be derived from the patient from a site 
remote from the tissue, for example because cells from the tissue have 
migrated from the tissue to other parts of the body (eg tumour tissue may be 
taken from the primary tumour or from metastases). Alternatively, the 
sample may be indirectly derived from the patient in the sense that, for 
example, the tissue or cells therefrom may be cultivated in vitro^ or 
cultivated in a xenograft model; or the nucleic acid sample may be one 
which has been replicated (whether in viti'o or in vivo) from nucleic acid 
from the original source from the patient. Thus, although the nucleic acid 
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derived from the patient may have been physically within the patient, it may 
alternatively have been copied from nucleic acid which was physically 
within the patient. 

Where the cellular location of the Kidins220 gene product is also determined, the 



Where the phosphorylation status is determined, the sample from the 
patient is preferably one which contains protein. 

The term "over-expression" is well understood in the art, and we include the 
meaning that the level of expression is detectably higher than in a control 
sample, as defined below. 

By "control sample'" we include a sample from a tissue which is not 
believed to be cancerous. The tissue may, or may not, be from the same 
individual who is the patient being diagnosed. Where the tissue is from the 
same individual who is being diagnosed it is preferred if the tissue is not 
one in which cancer has been suspected or found. 

A fiirther aspect of the invention provides a method of monitoring the 
progression of cancer in a patient comprising the steps of 

(i) determining the level of expression of the Kidins220 gene in a 
sample from the patient containing protein or nucleic acid; 

(ii) comparing the level of expression of step (i) with that in a 
previous sample from the same patient. 

By "Kidins220 gene" we mean a gene which encodes a polypeptide of the 
invention. For example the Kidins220 gene is one which encodes the 



sample from the patient is preferably one which contains cells. 
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polypeptides whose sequence is shown in Figures lb, 11 and 18 and 
variants thereof. 

Over-expression of the Kidins220 gene may be indicative of a cancerous or 
pre-cancerous state. Hence, an increase in the expression level (such as 
over-e?q)ression or greater degree of over-expression) may indicate that the 
tumour is growing or developing into a more advanced cancerous stage. 

As discussed above in relation to the method for diagnosing cancer, the 
level of gene expression in the sample may be determined by any 
convenient technique known in the art. 

According to one embodiment of this aspect, the cellular location of the 
Kidins220 gene product (Kidins220 polypeptide) is also determined in 
addition to the level of expression and compared to a previous sample. 

Methods of determining the location and amount of polypeptide are 
discussed above. 

According to another embodiment of this aspect, the phosphorylation status 
of the Kidins220 polypeptide is determined in addition to the level of 
expression, and compared to a previous sample from the patient. Methods 
of determining the phosphorylation status of Kidins220 are discussed above. 
A change in the phosphorylation status (as defined above) may indicate a 
progression or development in the cancer. 

Hence, the invention provides a use for an antibody capable of selectively 
binding to the Kidins220 polypeptide in medicine, and especially in the 
manufacture of a diagnostic reagent for diagnosing or prognosing cancer. 
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As is discussed above, Kidins220 has several splice variants. Hence, the 
choice of complementary polynucleotide or antibody which is to be used in 
determining the level of Kidins220 polynucleotide or polypeptide in a 
sample may depend on the particular splice variant which it is desired to 
detect, or may be designed such that it does not discriminate between 
Kidins220 splice variants. 

Although as discussed above determination of phosphorylation status or 
detennination of cellular location may be done in combination with the 
determination of the level of expression of the Kidins220 gene, they may 
suitably be done without determining the level of expression. 

Thus, a further aspect of the invention provides a method of diagnosing 
cancer in a patient comprising the steps of 

(i) determining the phosphor>4ation status of the Kidins220 
polypeptide in a sample from the patient containing protein; 

(ii) comparing the phosphorylation status of step (i) with that in a 
control sample. 

A yet further aspect of the invention provides a method of diagnosing 
cancer in a patient comprising the steps of 

(i) determining the cellular location of the Kidins220 polypeptide 
in a sample from the patient containing cells; 

(ii) comparing the cellular location of step (i) with that in a 
control sample. 
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As discussed above, the cellular location of the Kidins220 polypeptide may 
change in a cancerous or pre-cancerous cell, and may move from the plasma 
membrane to the cytoplasm and/or the nucleus. 

5 A still further aspect of the invention provides a method of monitoring the 
progression of cancer in a patient comprising the steps of 

(i) determining the phosphorylation status of the Kidins220 
polypeptide in a sample from the patient containing protein; 

(ii) comparing the phosphorylation status of step (i) with that in a 
10 control sample. 



A still further aspect of the invention provides a method of monitoring the 
progression of cancer in a patient comprising the steps of 

(i) determining the cellular location of the Kidins220 polypeptide 
in a sample from the patient containing cells; 

(ii) comparing the cellular location of step (i) with that in a 
control sample. 

As discussed above, the cellular location of the Kidins220 polypeptide may 
change in a cancerous cell, and the polypeptide may move from the plasma 
membrane to the c3^oplasm and/or the nucleus, or may no longer be 
targeted to the plasma membrane during expression of the encoding 
polynucleotide and instead be directed to the cytoplasm and/or the nucleus. 
Such a change may be indicative of a change in the development stage of 
the cancer, and thereby provide an indication of the degree success or 
failure of a treatment regime being administered to the patient following an 
earlier diagnosis of cancer. 
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Methods of determining cellular location and the phosphorylation status of 
Kidins220 are discussed above. 

Suitable samples and control sample are as described above. 

It will be appreciated that in the above methods of diagnosis of cancer or of 
monitoring the progression of cancer, it may be useful to determine the 
cellular location or phosphorylation status of a specific variant of 
Kidins220. Such variants are described above. Preferred variants have the 
sequence shown in Figure 18 and/or any of the splice forms shown in 
Figure 8. 

Another aspect of the invention provides a method of treating cancer in a 
patient comprising administering to the patient an agent which inhibits 
function of the Kidins220 gene or its product. 

The agent which inhibits the fimction of the Kidins220 gene may be any 
convenient agent. The agent may inhibit the function of the Kjidins220 gene 
at one or more of several levels. For example, it may inhibit transcription of 
the gene or translation of the mRNA or it may inhibit the function of the 
protein product. An example of a suitable agent includes a polynucleotide 
which is antisense to a polynucleotide according to the first or third aspect 
of the invention. 

Conveniently, the polynucleotide is antisense to a polynucleotide encoding 
a polypeptide according to the fifth aspect of the invention. 

Antisense oligonucleotides are single-stranded nucleic acids, which can 
specifically bind to a complementary nucleic acid sequence. By binding to 
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the appropriate target sequence, an RNA-RNA, a DNA-DNA, or RNA- 
DNA duplex is formed. These nucleic acids are often termed "antisense'* 
because they are complementary to the sense or coding strand of the gene. 
Recently, formation of a triple helix has proven possible where the 
oligonucleotide is bound to a DNA duplex. It was found that 
oligonucleotides could recognise sequences in the major groove of the DNA 
double helix. A triple helix was formed thereby. This suggests that it is 
possible to synthesise a sequence-specific molecules which specifically bind 
double-stranded DNA via recognition of major groove hydrogen binding 
sites. 

By binding to the target nucleic acid, the above oligonucleotides can inhibit 
the function of the target nucleic acid. This could, for example, be a result 
of blocking the transcription, processing, poly(A)addition, rephcation, 
translation, or promoting inhibitory mechanisms of the cells, such as 
promoting RNA degradations. 

Antisense oligonucleotides are prepared in the laboratory and then 
introduced into cells, for example by microinjection or uptake from the cell 
culture medium into the cells, or they are expressed in cells after 
transfection with plasmids or retroviruses or other vectors carrying an 
antisense gene. Antisense oligonucleotides were first discovered to inhibit 
viral replication or expression in cell culture for Rous sarcoma vims 
vesicular stomatitis virus, herpes simplex virus type 1, simian virus and 
iiifluen2a virus. Since then, inhibition of mRNA translation by antisense 
oligonucleotides has been studied extensively in cell-free systems including 
rabbit reticulocyte lysates and wheat germ extracts. Inhibitiou of viral 
function by antisense oligonucleotides has been demonstrated in vitro using 
oligonucleotides which were complementary to the AIDS HIV retrovirus 
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RNA (Goodchild, J. 1988 "Inhibition of Human Immunodeficiency Virus 
Replication by Antisense Oligodeoxynucleotides", Proc. Natl, Acad. Set 
(USA) 85(15), 5507-11). The Goodchild study showed that 
oligonucleotides that were most effective were complementary to the 
5 poly (A) signal; also effective were those targeted at the 5' end of the RNA, 
particularly the cap and 5' untranslated region, next to the primer binding 
site and at the primer binding site. The cap, 5' untranslated region, and 
poly(A) signal lie within the sequence repeated at the ends of retrovims 
RNA (R region) and the oligonucleotides complementary to these may bind 
10 twice to the RNA. 



Typically, antisense oligonucleotides are 15 to 35 bases in length. For 
example, 20-mer oligonucleotides have been shown to inhibit the 
expression of the epidemial growth factor receptor mRNA (Witters et al^ 

15 Breast Cancer Res Treat 53:41-50 (1999)) and 25-mer oligonucleotides 
have been shown to decrease the expression of adrenocorticotropic hormone 
by greater than 90% (Frankel et al, J Neurosurg 91:261-7 (1999)). 
However, it is appreciated that it may be desirable to use oligonucleotides 
with lengths outside this range, for example 10, 11, 12, 13, or 14 bases, or 

20 36, 37, 38, 39 or 40 bases. 

Antisense polynucleotides may be administered systemically. Altematively 
the inherent binding specificity of polynucleotides characteristic of base 
pairing is enhanced by limiting the availability of the polynucleotide to its 
25 intended locus in vzvo, permitting lower dosages to be used and minimising 
systemic effects. Thus, polynucleotides may be applied locally to achieve the 
desired effect. The concentration of the polynucleotides at the desired locus is 
much higher than if the polynucleotides were administered systemically, and 
the therapeutic effect can be achieved using a significantly lower total amount. 
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The local high concentration of polynucleotides enhances penetration of the 
targeted cells and effectively blocks translation of the target nucleic acid 
sequences. 



5 The antisense polynucleotides can be delivered to the locus by any means 
appropriate for localised administration of a drug. For example, a solution of 
the antisense polynucleotides can be injected directly to the site or can be 
delivered by infusion using an infiision pump. The antisense polynucleotides 
also can be incorporated into an implantable device which when placed 
10 adjacent to the desired site, to permit the antisense polynucleotides to be 
released into the surrounding locus. 



The antisense pol]/nucIeotides may be administered via a hydrogel material. 
The hydrogel is non-inflammatory and biodegradable. Many such materials 

15 now are known, including those made from natural and synthetic polymers. 
In a preferred embodiment, the method exploits a hydrogel which is liquid 
below body temperature but gels to form a shape-retaining semisolid hydrogel 
at or near body temperature. Preferred hydrogel are polymers of ethylene 
oxide-propylene oxide repeating units. The properties of the polymer are 

20 dependent on the molecular weight of the polymer and the relative percentage 
of polyethylene oxide and polypropylene oxide in the polymer. Preferred 
hydrogels contain from about 10% to about 80% by weight ethylene oxide 
and from about 20% to about 90% by weight propylene oxide. A particularly 
preferred hydrogel contains about 70% polyethylene oxide arid 30% 

25 polypropylene oxide. Hydrogels which can be used are available, for 
example, from BASF Corp., Parsippany, NJ, under the tradename Pluronic^. 

In this embodiment, the hydrogel is cooled to a liquid state and the 
oligonucleotides are admixed into the liquid to a concentration of about 1 
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mg polynucleotides per gram of hydrogel. The resulting mixture then is 
applied onto the surface to be treated, for example by spraying or painting . 
during surgery or using a catheter or endoscopic procedures. As the 
polymer wamis, it solidifies to fomi a gel, and the antisense polynucleotides 
5 diffuse out of the gel into the surrounding cells over a period of time 
defined by the exact composition of the geh 

The polynucleotides can be administered by means of other implants that are 
conmmercially available or described in the scientific literature, including 

10 liposomes, microcapsules and implantable devices. For example, implants 
made of biodegradable materials such as polyanhydrides, polyorfhoesters, 
polylactic acid and polyglycolic acid and copolymers thereof, collagen, and 
protein polymers, or non-biodegradable materials such as ethylenevinyl : 
acetate (EVAc), polyvinyl acetate, ethylene vinyl alcohol, and derivatives 

15 thereof can be used to locally deliver the polynucleotides. The 
polynucleotides can be incorporated into the material as it is polymerised or 
solidified, using melt or solvent evaporation techniques, or mechanically 
mixed with the material. In one embodiment, the polynucleotides are mixed 
into or applied onto coatings for implantable devices such as dextran coated 

20 silica beads, stents, or catheters. 

The dose of polynucleotides is dependent on the size of the polynucleotides 
and the purpose for which is it administered, hi general, the range is 
calculated based on the surface area of tissue to be treated. The effective dose 
25 of polynucleotides is somewhat dependent on the length and chemical 
composition of the polynucleotides but is generally in die range of about 30 to 
3000 |Lig per square centimetre of tissue surface area. 
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The polynucleotides may be administered to the patient systemically for 
cosmetic, therapeutic and prophylactic purposes. The polynucleotides may be 
administered by any effective method, for example, parenterally (eg 
intravenously, subcutaneously, intramuscularly) or by oral, nasal or other 
5 means which permit the oligonucleotides to access and circulate in the 
patient's bloodstream. Polynucleotides administered systemically preferably 
are given in addition to locally administered polynucleotides, but also have 
utility in flie absence of local administration. A dosage in the range of from 
about 0.1 to about 10 grams per administration to an adult human generally 
10 will be effective for this purpose. 

It will be appreciated that antisense agents also include larger molecules 
which bind to said Kidins220 mRNA or genes and substantially prevent 
expression of said Kidins220 mRNA or genes and substantially prevent 
15 expression of said protein. Thus, expression of an antisense molecule which 
is substantially complementary to said Kidins220 mRNA is envisaged as part 
of the invention. 

The said larger molecules may be expressed from any suitable genetic 
20 constmct and delivered to the patient. Typically, the genetic construct which 
expresses the antisense molecule comprises at least a portion of the said 
Kidins220 cDNA or gene operatively linked to a promoter which can express 
the antisense molecule in the cell. 

25 Although genetic constmcts for delivery of polynucleotides can be DNA or 
RNA it is preferred if it is DNA. Equivalent genetic constructs can be used 
to deliver antisense polynucleotides to a patient as described above in 
relation to the delivery of polynucleotides encoding Kidins220 polypeptide 
or variants thereof. 
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Preferably, the genetic construct is adapted for delivery to a human cell. 

Preferably, the antisense molecule is one which is capable of decreasing or 
5 inhibiting the function of the Kidins220 gene. Any decrease in the function 

of the said gene, for example the expression of the gene, is preferably a 

reduction of at least 10%, 20%, 30%, 40%, 50% of the level observed. 

More preferably, the function is reduced by 60%, 70%, 80% or 90%. 

Preferably, the reduction in function is such that the Kidins220 gene is 
10 expressed to the same level as observed in tissues in the same patient which 

are not cancerous or pre-cancerous. 

hi a preferred embodiment, the polynucleotide which is antisense further 
comprises a vector which is designed to express antisense DNA, Hence, the 
15 invention further provides a polynucleotide comprising a nucleic acid 
sequence which is antisense to a polynucleotide encoding the Kidins220 
polypeptide for use in medicine, especially in the manufacture of a 
medicament for treating cancer. 

20 Further examples of agents which inhibit the function of the Kidins220 gene 
or its product include antibodies capable of selectively binding the 
Kidins220 polypeptide or a variant thereof, and ribozyme molecules which 
selectively cleave polynucleotides encoding the Kidins220 polypeptide. 
Suitable antibodies include those of the invention as described above which 

25 interfere with the function of the Kidins220 pol)T)eptide, for example, by 
binding the serine at residue 919 (or its equivalent) and preventing 
phosphorylation or dephosphorylation, or by binding a protein interaction 
domain and thereby preventing a protein-protein interaction required for a 
function of the Kidins220 polypeptide. Methods of producing ribozymes 
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capable of selectively cleaving polynucleotides are well known in the airt. 
Suitable ribozymes can be developed with only routine experimentation. 

The cancer refeixed to in any one of the preceding methods of diagnosing, 
5 prognosing or treating cancer or monitoring the progression of cancer in a 
patient may be any type of neoplasm or tumour. Preferably, the cancer is 
one of astrocytic origin, such as glioblastoma multiforme, or the cancer is a 
prostate cancer. The level of Kidins220 expression is known to be 
especially elevated in cells from these types of cancers. 

10 

The invention further provides an agent capable of inhibiting the function of 
the Kidins220 gene or its product, an antibody according to the invention, 
and a polynucleotide which is antisense to a polynucleotide according to the 
first or third aspects of the invention, for use in medicine. 

15 

The invention also provides the use of an agent capable of inhibiting the 
function of the Kidins220 gene or its product in the manufacture of a 
medicament for treating cancer. Preferably, the agent isr an antibody 
according to the invention or a polynucleotide which is antisense to a 
20 polynucleotide according to the first or third aspects of the invention. 

The invention further provides the use of an agent which is capable of 
detecting the expression of the Kidins220 gene in the manufacture of a 
diagnostic reagent for diagnosing or prognosing cancer or for monitoring 
25 the progression of cancer in a patient. Preferably, the agent is an antibody 
according to the invention or a polynucleotide according to the first aspect 
of flie invention. 
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The uses discussed above may be in respect of any cancer. Preferably, the 
cancer is glioblastoma multiforme or prostate cancer. 

A further aspect of the present invention provides compounds comprising 
5 (i) a moiety which selectively binds the Kidins220 polypeptide and (ii) a 
further moiety. 

According to a preferred embodiment, the moiety capable of selectively 
binding to the Kidisn220 polypeptide is an antibody, preferably an antibody 
10 of the present invention as described above. 



In an alternative embodiment, the moiety capable of selectively binding to 
the Kidins220 polypeptide is a peptide. As discussed above, it is known 
from Kong et al (2001) J. NeiiroscL 21:176-185 that Kidins220 physically 
15 associates with the TrkA and p75 neurotrophin receptors. 



Any cognate ligand for Kidins220 which is capable of selectively binding a 
region of the polypeptide which is located extracellularly may be useful. 
The extracellular regions of Kidins220 are likely to be located within the 

20 residues located between transmembrane domains 1 and 2 and between 
domains 3 and 4. These extracellular regions may be represented by amino 
acids 521-525 and 661-688 of the Kidins220 polypeptide sequence given in 
Figure lb, or as described in Kong et al (2001) (supra). Any cognate 
ligands for Kidins220 will be a suitable moiety for selectively binding 

25 Kidins220. Peptides binding Kidins220 can be identified by means of a 
screen. A suitable method or screen for identifying peptides or other 
molecules which selectively bind Kidins220 may comprise contacting the 
Kidins220 polypeptide with a test peptide or other molecule under 
conditions where binding can occur, and then determining if the test 
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molecule or peptide has bound Kidins220. Methods of detecting binding 
between two moieties are well known in the art of biochemistry. 
Preferably, the known technique of phage display is used to identify 
peptides or other ligand molecules which bind to Kidins220. An alternative 
5 method includes the yeast two hybrid system. 

Peptides or other agents which selectively bind the Kidins220 polypeptide 
include those which modulate or block the function of Kidins220. 

10 Suitable peptides may be synthesised as described in more detail above. 

The further moiety may be any further moiety which confers on the 
compound a useful property with respect to the treatment or imaging or 
diagnosis of diseases. Such diseases or other conditions or states are 
15 described in more detail below. In particular, the further moiety is one 
which is useful in killing tumour cells or imaging cancer. Preferably, the 
further moiety is one which is able to kill the cancer cells to which the 
compound is targeted. 

20 In a preferred embodiment of the invention the further moiety is directly or 
indirectly cytotoxic. In particular the further moiety is preferably directly or 
indirectly toxic to cells in a tumour or cells which are in close proximity to 
and associated with a tumour. 

25 Further embodiments and examples of further moieties which may be useful 
in the compound of the invention are described in more detail in WO 
00/64480 (incorporated herein by reference). 
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Alternatively, the compound may be produced as a fusion compound by 
recombinant DNA techniques whereby a length of DNA comprises respect- 
ive regions encoding tlie two moieties of the compound of the invention 
either adjacent one another or separated by a region encoding a linker 
5 peptide which does not destroy the desired properties of the compound. 
Conceivably, the two portions of the compound may overlap wholly or 
partly. 

The DNA is then expressed in a suitable host to produce a polypeptide 
10 comprising the compound of the invention. 

The invention also provides a kit of parts (or a therapeutic system) 
comprising (1) a compound of the invention wherein the further moiety 
which is able to convert a relatively non-toxic prodrug into a cytotoxic drug 
15 and (2) a relatively non-toxic prodrug. Tlie kit of parts may comprise any 
of the compounds of the invention and appropriate prodrugs as herein 
disclosed. 

The invention also provides a kit of parts (or a therapeutic system) 
20 comprising (1) a compound of the invention wherein the further moiety is 
able to bind selectively to a directly or indirectly cytotoxic moiety or to a 
readily detectable moiety and (2) any one of a directly or indirectly 
cytotoxic or a readily detectable moiety to which the further moiety of the 
compound is able to bind. 

25 

In a further embodiment, the binding moiety may be comprised in a 
delivery vehicle for delivering nucleic acid to the target. The delivery 
vehicle may be any suitable delivery vehicle. 
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Alternatively^ the further portion may comprise a polypeptide or a 
polynucleotide encoding a polypeptide which is not either directly or 
indirectly cytotoxic but is of therapeutic benefit, 

5 In a further embodiment of the invention, the further moiety comprised in 
the compound of the invention is a readily detectable moiety. 



By a "readily detectable moiety" we include the meaning that the moiety is 
one which, when located at the target site following administration of the 
10 compound of the invention into a patient, may be detected, typically non- 
invasively from outside the body and the site of tiie target located. Thus, 
the compounds of this embodiment of the invention are useful in imaging 
and diagnosis. 

15 The compounds of the final aspect of the invention are use&l in treating, 
imaging or diagnosing disease, particularly cancer and neurodegenerative 
conditions. 

Thus, the invention further provides a method of imaging cancer in the body 
20 of an individual the method comprising administering to the individual an 
effective amount of a compound according to the invention wherein the 
further moiety is a readily detectable moiety. 

In a preferred embodiment of this aspect of the invention, the method of 
25 imaging cancer in an individual comprises the further step of detecting the 
location of the compound in the individual. 

Detecting the compound or antibody can be achieved using methods well 
known in the art of clinical imaging and diagnostics. The specific method 
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required will depend on the type of detectable label attached to the 
compound or antibody. For example, radioactive atoms may be detected ^ 
using autoradiography or in some cases by magnetic resonance imaging 
(MRI) as described above. 

5 

The invention further provides a method of diagnosing or prognosing cancer 
in an individual the method comprising administering to the individual an 
effective amount of a compound according to the invention wherein the 
further moiety is a readily detectable moiety. 

10 

The method may be one which is an aid to diagnosis. 

In a preferred embodiment of this aspect of the invention, the method of 
diagnosing, or aiding diagnosis of, cancer in an individual comprises the 
15 further step of detecting the location of the compound in the individual. 

The invention still further provides a method of treating an individual in need of 
treatment, the method comprising administering to the individual an effective 
amount of a compound according to the invention wherein the further moiety is a 
20 cytotoxic or therapeutic moiety. 

Typically in the above-mentioned methods of treatment, the further moiety 
is one which destroys or slows or reverses the growth of the cancer. 

25 For example, in the case where the compound of the invention comprises a 
readily detectable moiety or a directly cytotoxic moiety, it may be that only 
the compound, in a suitable formulation, is administered to the patient. Of 

course, other agents such as immunosuppressive agents and the like may be 
administered. 
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In respect of compounds which are detectably labelled, imaging takes place 
once the compound has localised at the target site. 

5 However, if the compound is one which requires a further component in 
order to be useful for treatment, imaging or diagnosis, the compound of the 
invention may be administered and allowed to localise at the target site, and 
then the further component administered at a suitable time thereafter. 

10 In a preferred embodiment of these aspects of the invention, the compounds 
are suitable for use in medicine. 

The invention further contemplates a pharmaceutical composition 
comprising a compound of the invention and a pharmaceutically acceptable 
15 carrier. 

A compound of the invention may be used in the manufacture of an agent 
for imaging cancer in a body of an individual or a diagnostic or prognostic 
agent for cancer or in the manufacture of a medicament for treating cancer. 



20 



The invention will now be described in more detail with reference to the 
following Figures and Examples: 
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Figure 1. 

Cloning and tissue distribution of Kidins220, a novel PKD-interacting 
protein. 

(A) Immunoprecipitation with anti-PKD/PKCfx antibodies was carried out 
on PC12 cells treated (+) or untreated (-) with 200 nM PDB and the 
immunocomplexes were tested in an in vitro kinase assay as described in 
Experimental Procedures in Example 1. The samples were analysed by 
SDS-PAGE and autoradiography. The radioactive bands corresponding to 
PKD and to a 220 kDa protein (p220) are indicated. 

(B) Primary sequence of Kidins220. The predicted protein sequence 
corresponding to the complete ORF of Kidins220 cDNA is shown. The 
three peptides obtained by mass spectroscopy are double underlined and 
correspond to EST sequences AA998517 (rat), N83544 (human) and 
AA685317 (rat), respectively. The N-terminal 11 ankyrin repeats (AA 37- 
69, 70-102, 103-135, 137-169, 170-202, 203-235, 236-268, 269-301, 302- 
334, 335-367, 368-402; grey boxes), the ATP-binding motif (AA 467-474; 
bold underlined), the four transmembrane domains (AA 500-520, 526-546; 
660-680 and 689-709; dark boxes), serine 919 (circled), and the splicing 
cassette (arrowheads) are indicated. The nucleotide sequence data are to be 
made available from GenBank/EMBL/DDBJ under the accession number 
AF239045. 

(C) Schematic representation of the stmcture of rat Kidins220 and similarity 
(grey boxes) with the C. elegans (F36H1.2) and the D. melanogaster 
(AC007175) orthologues. 

OD) Northern blots with 2 |Ag of poly(A)+ RNA from each of the indicated 
tissues (left) and 10 ^ig of total RNA from each cell line (right) were 
hybridised with a random-primed frill length Kidins220 cDNA probe. Sizes 
of RNA markers (in kb) are indicated. 
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Figure 2. 

Kidins220 is an ATP-binding integral membrane protein 

(A) 50 jig of PC 12 lysates were analysed by Western blot with an antibody 
against a carboxy-terminal Kidins220 peptide (AA 1747-1763). The 

5 antibody was used directly in Western blot, either without (-) or with (+) 
pre-incubation with the inunnnising carboxy-terminal peptide or in 
immunoprecipitation followed by Western blot (ip). The grey arrowhead 
indicates the position of the IgG heavy chain. 

(B) Kidins220 and PKD immnnoprecipitates were incubated in the dark (-) 
10 or under u.v. light (+) with 8-azido [^^P] a-ATP as described in the 

Experimental Procedures in Example 1, before SDS-PAGE analysis and 
autoradiography. Major radiolabelled bands corresponding to Kidins220 

and PKD are indicated. 

(C) Analysis on sucrose velocity gradients of Kidins220 and PKD 
15 distribution in post-nuclear supematants from PC 12 cells. Fractions (1-18), 

from the top to the bottom, were probed in Western blot for endogenous 
Kidins220 and PKD. The two major peaks (low density, 4-8 and high 
density, 11-15) containing Kidins220 were ftirther fractionated on two 
separated sucrose equilibrium gradients. Fractions were analysed by 
20 Western blot as above. Despite the fact that the majority of PKD is soluble 
in unstimulated cells, a sub-pool of PKD co-migrates with both Kidins220 
peaks. 

(D) PC12 cells homogenate (SF), nuclear pellet (NP), post-nuclear 
supernatant (PNS) and the correspondent soluble (S) and membrane (P) 

25 fractions were prepared as described in the Experimental Procedures of 
Example L Equivalent amounts of membrane pellet (100 ^g) were 
incubated on ice in a buffer containing either 1 M KCl (KCl), or 100 mM 
sodium carbonate (pHl 1). Samples were centrifriged and both the soluble 
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(S) and the pellet (P) fractions were analysed by Western blot with anti- 
Kidins220 antibodies. 

(E) Triton-X 114 partitioning results in the distribution of Kidins220 signal 
in the aqueous (Aq) phase or the insoluble pellet (P), whilst only a niinor 
1 fraction associates with the detergent (D) phase as shown by Western blot 
with anti-Kidins220 antibodies. 



Figure 3. 

Kidins220 is a plasma membrane protein present in lipid rafts. 

10 (A) Immunofluorescence analysis of PC12 cells was performed by using the 
Kidins220 carboxy-terminal polyclonal antibody in the absence (a) or 
presence (b) of the immunising peptide. Specific Kidins220 signal was 
observed both on the plasma membrane (arrowheads) and on an 
intracellular compartment. In non-permeabilising conditions (np, panel c), 

15 no specific staining was observed. Upon NGF-dififerentiation, Kidins220 
immunostaiaing concentrates at the tips of the neurites (panel d, arrowheads 
and inset). Confocal microscopy images corresponding to the projection of 
a series of 0.4iLim sections are shown. Bar: 5 fxm. Immuno electron 
microscopy analysis of Kidins220 localisation in a representative section 

20 from a pellet of prefixed NGF-differentiated PC 12 cells is shown in panel 
e. Kidins220 is distributed predominantly on the plasma membrane of 
neurite extensions. 

(B) PC 12 membranes were incubated on ice for 30 min in a buffer 
containing either 1% Triton-XlOO (TXlOO) or 1% saponin and 1% Triton 

25 XI 00 (saponin). After centrifugation, the solubilised proteins (S) and an 
insoluble pellet (P) were loaded and analysed by Western blot with anti- 
Kidins220 antibodies. 

(C) NGF-diffentiated PC 12 cells were processed for immunofluorescence 
using anti-Kidins220 polyclonal antibodies (grey) and a monoclonal 
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antibody against the GPI-anchored protein Thy-1 (grey). Confocal 
microscopy images corresponding to a single section (0.4 |4.m) of a series 
are shown. The "Merge" panel shows the extent of colocalisation between 
the two proteins (white). This merged image corresponds to a single section 
taken at the level indicated by arrowheads in the top (z-x) and lateral (z-y) 
views. These cross-sections are derived from the complete series of Thy-1 
and Kidins220 images. Scale: 5 ^im. 

Figure 4. 

K]dins220 is a phosphoprotein interacting with PKD in vitro and in 
vivo. 

(A) PDB-treated (+) or untreated (-) PC12 cells were solubilized and total 
lysates were subjected to immunoprecipitation with PKD (PKD ip) or 
Kidins220 (Kidins220) polyclonal antibodies or the corresponding pre- 
immune serum (control ip). The antibodies alone (Ab) were also tested in 
the absence of cell lysates. Immunoprecipitates were separated by SDS- 
PAGE and analysed by Western blot with Kidins220 and PKD antibodies as 
indicated. - 

(B) Immunoprecipitates obtained as described in panel A were assayed in 
an in vitro kinase assay (IVK) (see Experimental Procedures in Example 1). 
Radioactive bands correspondent to Kidins220 and PKD are present in both 
anti-Kidins220 and anti-PKD immunoprecipitates. The asterisk labels tibe 
autophosphorylated PKD band. 

Figure 5» 

PKD phosphorylates directly Kidins220 at a single Serine residue. 
(A) Vector alone (pcDNA3) or PKD mutants (PKD-Wt, PKD-D733A, 
PKD-S744E/S748E) and recombinant Kidins220-VSVG were transfected 
separately in COS-7 cells, stimulated with (+) or without (-) PDB and, after 
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lysis, treated with anti-PKD and anti-VSVG antibodies. PKD and 
Kidins220 beads were then mixed and incubated for 10 min at 30° C in an 
in vitro kinase assay. Samples were analysed by SDS-PAG& and 
autoradiography. PKD-Wt and the constitutively active mutant 
5 S744E/S748E efficiently phosphorylate Kidins220, whilst the dominant 
negative mutant PKD-D733A is ineffective. 

(B) Sequence of the peptides derived from Kidins220 sequence used in the 
in vitro exogenous substrate phosphorylation assays. Ser919 and its 
substitution by Ala919 are in bold. C-term corresponds to the carboxy- 

10 terminal region of Kidins220 (AA 1747-1763). 

(C) PKD-Wt containing immuno-complexes derived from COS-7 
transfected cells treated (+, filled bars) or untreated (-, empty bars) with 100 
vM. PDB were incubated with the different peptides (Syn-2, C-term, Ala 
919 and Ser919) and the level of peptide phosphorylation was measured 

15 (Iglesias et aL^ 1998c). Results are expressed as fold increase over Syn-2 
phosphorylation by PKD under non-stimulated conditions and the values 
are the mean ±S.E. of two independent experiments performed in triplicate. 

(D) PKD-Wt and its mutants PKD-D733A and PKD-S744E/S748E were 
immunoprecipitated from COS-7 cells treated (+, filled bars) or untreated (-, 

20 empty bars) with 100 nM PDB and then incubated with 2.5 mg/ml of the 
Ser919 peptide in an in vitro kinase assay. The level of peptide 
phosphorylation was measured as specified in Experimental Procedures. 
The values shown are the mean ±S.E. of two independent experiments 
performed in triplicate. 

25 (E) Effect of peptide substrate competition on PKD-mediated Kidins220 
phosphorylation. PC12 cells treated (+) or untreated (-) with 100 nM PDB 
were lysed and endogenous Kidins220 and PKD were immunoprecipitated 
with anti-PKD antibodies. The immune-complexes were incubated in the 
presence of 2.5 mg/ml of the different peptides (C-terminal, Ala 919 and 
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Ser919) in an in vitro kinase assay and then analysed by SDS-PAGE and 
autoradiography. Only the Ser919 peptide competes with Kidins220 
phosphorylation. 

Figure 6. 

Degradation of Kidins220 in cells expressing tlie PKD dominant 
negative mutant PKD-D733A. 

(A) PC12 cells transfected with GFP-PKD-Wt (panel a) or the dominant 
negative mutant GFP-PKD-D733A (panel b) were fixed and analysed by 
confocal microscopy. In contrast to the homogeneous distribution of GFP- 
PKD-Wt, the dominant negative mutant GFP-PKD-D733A accmnulates in 
large aggregates (panel b and inset). Confocal microscopy images 
corresponding to the projection of a series of 0.4 ^im sections are shown. 
Panels a and b, bar 5 jim; inset, bar 2.5 \xm. 

(B) PC 12 cells transfected with the dominant negative mutant GFP-PKD- 
D733A (panel a) were fixed and immunostained for endogenous Kidins220 
(grey, panel b). The merged image is shown in panel c, where the white 
indicates colocalization. PKD and Kidins220 are both present in large 
aggregates (arrowheads). Confocal microscopy images correspond to the 
projection of a series of 0.4 jim sections. Representative images of 100 cells 
are shown. Bar 5 |im. 

(C) Vector alone (-) or containing Kidins220-VSGV (Kidins220-VSVG) 
was co-transfected with pcDNA3 (-) or PKD-Wt (Wt) or its mutants PKD- 
D733A (D733A) or PKD-S744E/S748E (S744E/S748E) in COS.7 cells. 
After 72 hours, the level of Kidins220-VSVG and PKD mutants present in 
100 |j.g of total cell pellet was analysed by Westem Blot with anti-VSVG 
antibodies and anti-PKD antibodies, respectively. Co-transfection with the 
dominant negative mutant PKD-D733 causes the disappearance of 



ENSCCCiO: <WC 02207e5A2_L> 



wo 02/20786 PCT/GBOl/03977 

89 

Kidiiis220-VSVG, whilst the constitutively active PKD-S744E/S748E 
determines its accumulation. 



Figure 7. 

5 The complete DNA sequence of the rat Kidins220 cDNA. The open 
reading frame (ORF) corresponds to bases 1 to 5289, which encode amino 
acids 1 to 1763 as shown in Figure lb. 

Figure 8. 

10 Sequence of several splice variants of Kidins220 present in rat 
pheochromocytoma cells. 

Sequence analysis of independent Kidins220 clones highlighted the 
presence of a variable region in the central portion of the molecule. The 

15 identified cassette has a maximum dimension of 101 amino acid residues 
(clone 6) with inserts varying in dimension and sequence. Numbers without 
square brackets in this figure refer to the numbering shown in Figure 16 
(amino acids) and Figure 7 (nucleotide sequence). Numbering shown 
within square brackets in this figure refer to the numbering shown in Figure 

20 18 (amino acids) and Figure 17 (nucleotide sequence). 

Figure 9. 

Kidins220 is distributed in light and heavy plasma membrane fractions 
and co-localises with PKD sub-pools. 

25 (A) Analysis on sucrose velocity gradients of Kidins220 and PKD 
distribution in post-nuclear supematants from PC12 cells. Fractions (1-18), 
from the top to the bottom, were probed in Western blotting for endogenous 

Kidins220 and PKD. The two major peaks (top fractions, 4-8 and lower 
fractions, 11-15) containing Kidins220 were further fractionated in (B) on 
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two separated sucrose equilibrium gradients (top, fractions 4-8; bottom, 
fractions 11-15). Fractions were collected (1-10) and analysed by Western 
blot as above. Despite the fact that the majority of PKD is soluble in 
unstimulated cells, a sub-pool of PKD co-migrates with both Kidins220 
peaks. 

Figure 10. 

An alignment of the two human Kidins220 amino acid sequences which 
are publicly available from GenBank database entries AL 133 620 and 
AB033076. In fee region of overlap where very low sequence identity is 
observed (residues 10006-1031 of AL133620 and residues 506-531 of 
AB033076), the sequence of AB033076 is considered to be correct by 
comparison with the sequence of rat Kidins220. 

Figure 11. 

The full length amino acid sequence of human Kidins220. 
Figure 12. 

Sequence comparison between different Kidins220 orthologues. 

Ratttis noi'\^egicus (Rn), Human (Hs), Drosophila (Dm) and C elegans (Ge) 
Kidins220 sequences were ahgned with Clustal X using default parameters 
(gap open penalty: 10; gap extension penalty: 0.2; protem weight matrix: 
Gonnet). The position of invariable residues is indicated by asterisks, 
residues with similar charge, hydrophobic index or aromatic properties are 
indicated by a double dot, residues with the similar character (hydrophylic 
or hydrophobic) by a single dot. 
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Figure 13, 

The cDNA sequence of human Kidins220 assembled from GenBank 
database entries AL133620 and AB033076. Nucleotides corresponding to 
the translated region are in capitals. Non-translated nucleotides are in lower 
case. 

Figure 14, 

A list of publicly available ESTs which display significant alignment to 

(a) rat and (b) human Kidins220. For both the rat and human sequences, the 
ankyrin repeat regions (shown in Figure lb) were masked during analysis of 
sequence similarities, to exclude ESTs which are simply ankyrin repeat 
containing sequences. 

Figure 15^ 

An alignment of the human (shown as the upper sequence) and rat 
(shown below the human sequence) Kidins220 cDNA sequences. The 
coding region is shown in capitals, and non-coding nucleotides are shown in 
lower case. 

Figure 16, 

The human Kidins220 promoter region located within Genbank 
Accession number AC012495 (human chromosome 2). The sequence 
between positions 34609-36632 denotes the region containing the putative 
transcription start site. Underlined positions 36510 to 36632 correspond to 
the 5' end of Ihe human Kidins220 cDNA sequence that is shown in Figure 
13. The sequence was derived by using the 5' end of the assembled human 
Kidins220 DNA sequence shown in Figure 13 to search the publicly 
available human genome sequence. 
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Figure 17. 

DNA sequence of rat Kidins220. The sequence corresponds to 
Genbank Accession No AF239045, 

5 Figure 18. 

Amino acid sequence of rat Kidins220. The sequence corresponds to 
GenBank Accession No AAG35185. 

Figure 19. 

10 Representation of the domain structure of Kidins220, and the relative 
localisation of various motifs with in the polypeptide: 

aal7 — 391 Ankyrin repeat region circular profile. 
aa707-748 Domain of unknown function DIJP38. 
aal246-1256 Na+/K+ ATPase C-terminus. 
15 aal204-1224 PIPLC X DOMAIN: 

Potential phosphorylated amino acids are indicated by a encircled 'T*', 
DUF38 domains are indicated by an encircled "DUF38". 

Figure 20. 

20 Distribution ofKidins220 in developing mice embryos. 
Figure 21. 

Distribution of Kidins220 in the brain. 

25 Figure 22. 

SAGE analysis of human Kidins220 expression. 
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Figure 23. 

Immunohistochemical analysis of K]dins220 expression in the U87MG 
cell line. 

5 Figure 24. 

Nucleotide sequence comparison of the rat DNA sequence shown in 
Figure 7 with that shown in Figure 17. The sequence firom Figure 7 is 
shown above the sequence from Figure 17. 

10 Figure 25. 

Amino acid sequence comparison of the rat amino acid sequence shown 
in Figure lb with that shown in Figure 18. The sequence from Figure lb 
is shown above the sequence from Figure 18. - 

15 Example 1: Kidins220, a new neuronal PKD substrate, defines a novel 
role for PKD in the regulation of protein turnover. 

Purification and cloning of Kidins2209 a novel PKD substrate. 

PKD is expressed in many tissues, although the brain has one of the highest 
20 mRNA levels (Valverde et aL, 1994). In the search for PKD-specific 
substrates, we selected the pheochromocytoma cell line PC 12 because it has 
been extensively used to investigate multiple signal transduction pathways 
(Marshall, 1995). Additionally, PC 12 cells, (rat cancer cells of 
neuroendocrine origin) represent a well established cellular model to study 
25 neurite outgrowth and regulated secretion and express considerable amounts 
of PKD (Wooten et al, 1997). 

PC 12 lysates were inmiunoprecitated with specific PKD antibodies 
followed by an in vitro kinase assay. Co-immunoprecipitating proteins that 
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appeared to be phosphorylated under stimulating conditions, such as 
phorbol ester treatment, were analysed by SDS-PAGE and autoradiography. 
A band of -220kDa (p220), matching these requirements, was detected 
(Figure lA). To isolate p220, extracts prepared from 5x10^^ PC12 cells 
were subjected to the same steps of immunoprecipitation and in vitj'o kinase 
assay. After SDS-PAGE analysis, the resulting 220 kDa band was excised 
and directly processed for microsequencing. Protein fingerprinting and 
peptide sequence exhibited no significant sequence homology to any known 
protein. However, BLAST (National Center for Biotechnology 
Information, NCBI, Bethesda, MD) searches yielded a variety of human and 
rat expressed sequence tags (ESTs; peptide 1, FLFTDYNR, rat locus 
AA998517; peptide 2, TPSLSSLNSQDSSIEISK, human locus N83544; 
peptide 3, ANQNFDEIEGIR, rat locus AA685317; double underlined in 
Figure IB). 

The majority of p220 cDNA was amphfied by RT-PCR and the remaining 
5'-end was cloned by using 5'-RACE. The full-length clone was 7,140 bp 
and comprised 111 bp upstream of the initiation methionine. This 5' 
untranslated region contained several stop codons in all three reading 
frames and presented the correct sequence context for translational initiation 
(Kozak, 1992). The p220 clone had an extended 3 '-untranslated region of 
1,740 bp after the TGA stop codon. The open reading frame of 5,289 bp 
(Figure 7) encoded a putative protein of 1,763 amino acids (AA) with a 
predicted molecular mass of 196,561 kDa (Figure IB). Because of its PKD 
interaction, we have named this protein Kidins220 (Kinase d interacting 
substrate of 220 kDa). Although human, rat and mouse ESTs corresponding 
to partial sequences of the KidiQs220 cDNA were represented in the 
GenBank/EMBL/DDBJ database, analysis of protein databases using the 
BLAST Enhanced Alignment Utility algorithm failed to identify any 
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significant similarities to known proteins. However, BLAST searches found 
one positive match showing a high degree of homology (23,6% identity; 
34.3% similarity) with a open reading frame present in the Celegans 
cosmid F36H1.2 (GenBank accession number Q20109). BLAST searches 

5 revealed also several hits to an ordered segment of jD. melanoganster 
chromosome 2R. The best hit was within a contiguous 188 Kb DNA region 
(GenBank accession numbers AC007175) from which it is possible to 
predict a protein with the same domain composition as observed in rat and 
C elegans orthologues (Figure IC). Sequence comparison of the rat and C 

10 elegans clones revealed a higher degree of conservation in the amino- 
terminal region (residues 1-1065 presents 30% identity; 43.3% similarity), 
diverging more at the carboxy-temMinal end (Figure IC). During the 
preparation of this manuscript, two sequences corresponding to different 
fragments of the human Kidins220 homologue have been deposited to the 

15 GenBank (AB033076 and AL133620; Nagase et a/., 1999). 

Kidins220 is a multi-domain protein expressed in brain and neuronal 
cells. - 

20 The primary amino acid sequence of Kidins220 can be subdivided into 
several stmctural and fimctional domains (Figure IC). The amino-terminus 
begins with a region of —400 residues comprising 1 1 ankyrin repeats, which 
are likely to be involved in protein-protein interactions (Sedgwick and 
Smerdon, 1999). Downstream of this motif, the sequence contains a 

25 consensus ATP/GTP binding site (P-loop) at residues 467-474 
(AQWGSGKS), followed by four putative transmembrane domains. The 
presence of several membrane spanning domains strongly suggests that 
Kidins220 is an integral membrane protein, possibly localised to the plasma 
membrane (39.1% probability by using PSORT topology prediction 
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system). The similarity with the C elegans and D. melanogaster orthologue 
is interrupted by a large insertion, which contains a prohne-rich region (AA 
984 to 1054), followed by a splicing cassette (AA 1140-1232). This last 
feature suggests that mammalian Kidins220 is the prototype of a family of 
5 membrane proteins constituted by several variants (Figure 8). The splicing 
cassette is followed by a short region in which the homology between the C 
elegans and the rat protein returns (35.9% identity; 53.8% similarity). The 
carboxy-terminal half of the sequence is unrevealing, except for the 
abundance of serine and threonine residues, which could constitute potential 
10 phosphorylation sites for different kinases, including PKCs. Interestingly, a 
unique residue, Ser919, is present in a sequence context favoxirable for the 
specific phosphorylation by PKD (Figure IB and Table I)(Nishikawa et al, 
1997). 

15 TABLE I 



Consensus sequence for PKD phosphorylation 
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Position 
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20 We examined the expression of Kidins220 mRNA in various rat tissues and 
different cell lines. Northern blot analysis with Kidins220 cDNA revealed 
the presence of a -7.6 kb mRNA in brain (Figure ID). A much lower 
hydridisation signal was present in heart and it was xmdetectable in other 
tissues (Figure ID). Interestingly, an additional second messenger of ~5 kb 

25 was also present in brain and heart. Kidins220 mRNA was expressed 
predominantly in PCI 2 cells and to a lesser extent in the neuroblastoma line 
N2A and the glioma line C6 (Figure ID) but was not detectable in the other 
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cell lines tested. Altogether, these results suggest that Kidins220 is 
primarily expressed in tissues of nervous origin. 

Kidins220 is an ATP-binding plasma membrane protein 



To investigate the properties of Kidins220, polyclonal antibodies were 
raised against the last seventeen residues of its carboxy-terminus (AA 1747- 
1763). This antiserum recognised a 220 kDa protein in PC12 lysates (Figure 
2A). Pre-absortion with the immunising peptide completely abolished the 
10 recognition of the 220 kDa band, demonstrating its specificity (Figure 2A). 
The ability of this antibody to immunoprecipitate Kidins220 was also tested 
in PC12 lysates. As shown in Figure 2A, a 220 kDa band was specifically 
recovered, as revealed by subsequent Westem blot with the same Kidins220 
antiserum. 



The presence of a P-loop within the primary sequence of Kidins220 
prompted us to examine if Kidins220 is indeed an ATP-binding protein. The 
ATP binding in Kidins220 inununoprecipitates was tested with 8-azido- 
[^^P]a-ATP (see Experimental procedures). Under these conditions, 

20 Kidins220 is coyalently modified by the ATP analogue, as demonstrated by 
the radioactive band migrating at 220 kDa (Figure 2B). This labelling is 
specific, because it is absent in conditions not allowing the activation of the 
photoactivatable group (Figure 2B, no light) or in the presence of an excess 
of cold ATP. As expected, immunoprecipitation experiments with anti- 

25 PKD antibodies revealed two major radiolabelled bands corresponding to 
Kidins220 and PKD (Figure 2B, arrows). Phorbol ester treatment did not 
change the extent of labelling (data not shown), suggesting that Kidins220 
phosphorylation by PKD is not regulating its ATP binding ability. 



5 



15 
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To study the predicted association of Kidins220 with cellular membranes, 
the postnuclear supernatant from undifferentiated PC 12 cells was loaded on 
a sucrose velocity gradient (Dittie et ah^ 1997) and fractions were analysed 
for specific markers. Kidins220 was absent from the first two fractions at 
5 the top of the gradient (Figure 2C), which are enriched mainly in cellular 
soluble proteins (Dittie et aL, 1997), suggesting that Kidins220 is 
membrane-bound. Kidins220 was distributed along the gradient mainly in 
two peaks (low density, fractions 4-7 and high density, fractions 11-15; 
Figure 2C). Western blot with an antibody against Na"^/K^-ATPase, a typical 

10 marker of the plasma membrane, showed that Kidins220 and Na"*^/K^- 
ATPase overlap in the second peak representing the heavier fractions of the 
gradient (not shown). The presence of a Ughter peak of Kidins220 suggests 
that it is also associated with a low-density lipid-enriched membrane 
population. We examined the distribution of PKD in the same gradient with 

15 anti-PKD antibodies. The vast majority of PKD signal (>95%) appeared in 
the fractions at the top of the gradient, confirming its mainly cj^osolic 
localisation (Figure 2C). However, a low but significant PKD signal was 
also found in the remaining fractions (Figure 2C). The two Kidins220- 
enriched peaks were further fractionated in sucrose equilibrium gradients 

20 and protein distribution was analysed by Western blot. As shown in Figure 
2C, PKD and Kidins220 signals overlapped in both low and high density 
peaks (fractions 4-5 and 6-8, respectively), suggesting that two PKD sub- 
pools colocalise with Kidins220 under basal conditions. 

25 To determine whether Kidins220 is an integral membrane protein, PC12 
cells were fractionated and the post-nuclear supernatant (PNS) was further 
centrifiiged to give a soluble fraction (S) and a membrane pellet (P). The 
220 kDa band was only present in the membrane pellet (Figure 2D). To 
investigate in more detail the association of Kidins220 with membranes, the 
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membrane pellet was incubated with salt or exposed to mild alkaline 



Kidins220 was found associated to the pellet fraction under all conditions 
(Figure 2D), indicating that Kidins220 is an integral membrane protein. To 

5 confirm this observation, phase partition of the membrane pellet with Triton 
X-114 was performed and equal aliquots of the aqueous phase (Aq), 
detergent phase (D) and insoluble pellet (P) were analysed. Strikingly, a 
high percentage of Kidins220 partitioned in the aqueous phase, although 
some of the protein remained in the detergent phase, as well as in the 

10 insoluble pellet (Figure 2E). This is not unprecedented. Giantin, a large 
(-400 kDa) integral component of Golgi membranes (Linstedt and Hauri, 
1993), has been observed to partition in the aqueous phase upon Triton X- 
114 extraction, presimiably due to its extensive hydrophilic surface. The 
large hydrophilic carboxy-terminal half of Kidins220 could be responsible 

15 for this effect. 

Kidins220 concentrates at neurite tips and lipid rafts 

The cellular localisation of Kidins220 was examined by 
20 immunofluorescence in PC 12 cells using anti-Kidins220 antibodies. As 
shown in Figure 3A, Ridins220 distribution revealed an intense 
discontinuous staining at the plasma membrane, brighter in some membrane 
domains, being somehow dispersed through the cell body with a 
considerably high asymmetrical perinuclear labelling (Figure 3A, panel a). 
25 This intracellular and perinuclear staining did not colocalise with the Golgi 
markers GM130 and giantin, with the endoplasmic reticulum markers 
p62/calnexin, or with the intermediate compartment marker ErgicSB (not 
shown). The Kidins220 signal was blocked either by pre-absorption with 
the generating peptide or by omitting the permeabilisation step (Figure 3A, 



conditions, two treatments that remove extrinsic membrane proteins. 
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panels b and c, respectively), confirming, on one hand, the specificity of the 
antibody and, on the other, the intracellular localisation of the carboxy- 
temiinns of Kidins220. This fact and the presence of four transmembrane 
domains support a topology for Kidins220 in which both the carboxy- and 
5 the amino-terminus are intracellular. In nerve growth factor (NGF) 
differentiated PCI 2 cells, Kidins220 accumulated at neurite tips, suggesting 
the possibility of its involvement in neurite extension or growth cone 
organisation and dynamics (Figure 3A, panel d and inset). Immuno electron 
microscopy analysis of Kidins220 distribution in NGF-treated cells 
10 confimied its dual localisation on the plasma membrane of neurites and on 
an unidentified internal membrane-associated compartment (Figure 3A, 
panel e). 



To further analyse KiDin220 cellular distribution, the postnuclear 
15 supernatant from undifferentiated PC 12 cells was loaded on a sucrose 
velocity gradient (Dittie et al, 1997). After cenixifugation, gradient 
fractions were collected and analysed for the presence of specific markers, 
Kidins220 was distributed along the entire gradient mainly in two peaks 
mnning from fraction 4-7 and 11-15 (Figure 9A). Western blotting with an 
20 antibody against Na+/K-f-ATPase, a typical marker of the plasma 
membrane, showed that Kidins220 and Na-l-/K+-ATPase overlap in the 
second peak representing the heavier fractions of the gradient (11-15; not 
shown). The lighter peak of Kidins220 running at the top of the gradient 
suggests that this protein is also specifically present in a low-density 
25 membrane population, possibly constituted by a membrane fraction 
enriched in lipids. Since Kidins220 co-imniunoprecipitated with PKD in 
vivo, we examined the distribution of PKD in the same gradient with anti- 
PKD antibodies. The vast majority of PKD signal (>99%) appeared in the 
first two fractions at the top of the velocity gradient (Figure 9A), which are 
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enriched mainly in cellular soluble proteins (Dittie et al, 1997). However, a 
low but significant PKD signal was also found along all the remaining 
fraction of the gradient (Figure 9A), These results provide evidence that 
only a small portion of the total amount of PKD co-localised with 
5 Kidins220 under these experimental conditions. This conclusion is 
supported by the results obtained by further fi-actionating the two 
Kidins220-enriched peaks in sucrose equilibrium gradients, followed by 
Western blotting with anti-Kidins220 and anti-PKD antibodies. As shown 
in Figure 9B, the two proteins overlap only partially, showing colocalisation 
10 in fractions 4-5 (peak 1) and 6-8 (peak 2). 

Some plasma membrane components, including various 
glycosylphosphatidylinositol (GPI)-anchored proteins, transmembrane 
proteins and signalling molecules, co-isolate in non-ionic detergent-resistant 

15 fractions of the plasma membrane, known as detergent-insoluble glycolipid- 
emiched domain (DIGs) or lipid rafts (Harder and Simons, 1997; Brown 
and London, 1998). To gain insights into the association of Kidins220 with 
specific plasma membrane domains, its solubility in non-ionic detergent 
was analysed. Remarkably, Kidins220 was resistant to extraction by Triton 

20 X-100 at 4**C (Figure 3D). This result, together with the presence of 
Kidins220 in low density fi-actions (Figure 2C), strongly suggests its 
localisation to DIGs. A number of studies have used cholesterol-binding 
agents such as saponin to facilitate DIGs solubility in non-ionic detergents 
(Ilangumaran and Hoessli, 1998). As expected for a protein associated with 

25 lipid rafts, Kidins220 was totally solubilised when saponin and Triton-XlOO 
were combined (Figure 3B), Furthermore, double labelling of NGF- 
differentiated PC12 cells with antibodies against Thy-1, a GPI-anchored 
protein targeted to DIGs (Madore et aL, 1999), and Kidins220 demonstrated 
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that indeed these two proteins extensively colocahse in discrete regions of 
the plasma membrane (Figure 3C). 



In vivo interaction between Kidins220 and PKD. 

5 

To extend the analysis of PKD and Kidins220 interaction, we assayed the 
presence of PKD in Kidins220 immunoprecipitates. As shown in Figure 4A, 
immunoprecipitation of Kidins220 from PCI 2 lysates resulted in the co- 
iramimoprecipitation of PKD. The use of the antibody alone (Ab) or the 

10 pre-immune serum did not result in any recovery of PKD in the sample, 
confirming the specificity of the interaction. Reciprocally, as expected from 
the first in vitro kinase experiment (Figure lA), anti-PKD antibody co- 
immunoprecipitated Kidins220 and their association was not significantly 
altered upon PDB treatment (Figure 4A). The co-immunoprecipitated PKD 

15 was fiilly functional, as tested by in viti'O kinase assay carried out on the 
anti-Kidins220 beads (Figure 4B, asterisk). These results strongly suggest 
that Kidins220 is associated constitutively with a sub-population of PKD 
molecules and PKD/Kidins220 complexes are independent of phorbol esters 
stimulation. 

20 

PKD directly phosphorylates Kidins220 

Preliminary immunoprecipitation and in vitro kinase assays suggested that 
Kidins220 might be a potential substrate for PKD. In these assays, the 220 
25 kDa band was phosphorylated and the level of modification appeared to be 
increased by phorbol esters treatment (Figure lA). Due to its richness in 
serine and threonine residues (9.4% and 6.5%, respectively) and the 
presence of multiple potential phosphorylation sites in the carboxy-terminal 
portion, we tested whether Kidins220 is a phosphoprotein in vivo. 
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Immunoprecipitation of extracts from f P]-labelled PC 12 cells with anti- 
Kidins220 and anti-PKD antibodies revealed a radioactive band of 220 kDa 
corresponding to Kidins220, which strongly suggests that Kidins220 is 
phosphorylated in vivo (not shown). 



In vitro phosphorylation experiments were carried out by using recombinant 
Kidins220 as substrate. For this purpose, expression vectors containing a 
VSVG-tagged version of Kidins220 (Kidins220-VSVG) and PKD (Wt) 
were transfected separately in COS-7 cells and then iramunopurified with 

10 anti-VSVG and anti-PICD antibodies. In the experiment presented in Figure 
5A, immobilised Kidins220-VSVG was incubated with PKD-beads in the 
presence of [^^P] a-AlP, followed by SDS-PAGE and autoradiography. 
Under these conditions, a radioactive band corresponding to Kidins220 was 
detected and its level of phosphorylation increased following in vivo 

15 stimulation of PKD with phorbol ester (Figure 5A). Therefore, Kidins220 is 
specifically recognised and phosphorylated by PKD. To demonstrate 
conclusively that PKD kinase activity was directly responsible for 
Kidins220 phosphorylation, the constitutively active double mutant, PKD- 
S744E/S748E (S744E/S748E), and the dominant negative mutant, PKD- 

20 D733A (D733A), were used in parallel. As shown in Figure 5A, Kidins220 
was phosphorylated only when the active mutant was used and this 
phosphorylation completely disappeared in the presence of the dominant- 
negative PKD mutant. Altogether, these findings demonstrate that 
Kidins220 is phosphorylated in vitro by PKD, and strongly suggest that 

25 Kidins220 is most likely a PKD substrate in vivo. 

To date, specific physiological substrates for PKD have not been identified. 
The only information available is derived from the use of synthetic peptides, 
such as syntide-2 (Valverde et aL, 1994; Van Lint et aL, 1995), which are 
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efficiently phosphorylated in vitro. More recently, a peptide library study 
predicted the optimal substrate sequence for PKD/PKCjx (jLi-peptide) and 
showed the |i-peptide to be extremely specific for this isoform (Nishikawa 
et aL, 1997). Sequence analysis of Kidins220 revealed a consensus motif for 
PKD phosphorylation at position 919 (Ser919; Table I). On this basis, a 
synthetic peptide corresponding to this sequence was used as substrate for 
PKD (Kidins220-Ser919, AA 908-924) and the efficiency of 
phosphorylation compared to that of syntide-2. A mutated Ser to Ala 
peptide (Kidins220-Ala919) and the Kidins220 carboxy-terminal peptide 
(Kidins220 C-Term, AA 1747-1763) served as negative controls (Figure 
5B). The Kidins220-Ser919 peptide was phosphorylated very efficiently by 
PKD-Wt and its phosphorylation was increased following in vivo activation 
of PKD by phorbol esters. The single substitution of Ser to Ala or the use of 
the carboxy-terminal peptide abolished the phosphorylation completely 
(Figure 5C). The level of Kidins220-Ser919 phosphorylation by 
unstimulated PKD was consistently twofold higher when compared to that 
of syntide-2 (Figure 5C), demonstrating that this peptide is a better PKD 
substrate and suggesting that this sequence may be specifically 
phosphorylated by PKD in vivo. To further support a direct phosphorylation 
of Kj[dins220 by PKD and to exclude the involvement in this process of 
other co-immunoprecipitating kinases, the PKD dominant negative mutant, 
PKD-D733A and a constitutively active mutant, PKD-S744E/S748E were 
tested in parallel. As shown in Figure 5D, Kidins220-Ser919 was fully 
phosphorylated in the presence of the constitutively active PKD mutant 
independently of phorbol ester stimulation. In contrast, no phosphorylation 
was observed when the dominant negative form of PKD was used. In 
addition, after co-immunoprecipitation of native PKD and Kidins220 from 
PC 12 cells, the KiDin-Ser919 peptide was able to compete ahnost 
completely (>80%) the phosphorylation of endogenous Kidins220, whereas 
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the alanine mutant version or the carboxy-terminal peptide did not irdhdbit 
this process (Figure 5E). 

PKD is a modulator of Kidms220 trafficldng and turnover 

5 

To provide evidence of die involvement of PKD in Kidins220 cellular 
function(s), we transfected different GFP-PKD mutants into PC 12 cells and 
analysed their effects on endogenous Kidins220 by confocal microscopy. 
GFP-PKD- Wt in NGF-treated PCI 2 cells was uniformly distributed 

10 throughout the cell soma and neurites (Figure 6 A, panel a) and was 
indistinguishable from tlie constitutively active mutant GFP-PKD- 
S744E/S748E (not shown). Strikingly however, the dominant negative 
GFP-PKD-D733A was mainly restricted to large intracellular aggregates ~ 
(Figure 6A, panel b and inset), suggesting that PKD activity is essential for 

15 its physiological localisation in PCI 2 cells. Further analysis of GFP-PKD- 
D733A expressing cells revealed a partial colocalisation with Kidins220 in 
these large aggregates, as shown by the yellow pseudocolour in the merged 
confocal image (Figure 6B, arrowheads). In contrast, Kidins220 
immunostaining in GFP-PKD-Wt or GFP-PKD-S744E/S748E cells was 

20 identical to the pattern observed prior to transfection (not shown), indicating 
that PKD over-expression itself does not alter Kidins220 distribution. 

Unexpectedly, the level of PKD dominant negative expression seemed to 
inversely correlate with Kidins220 signal (Figure 6B, upper cell vs. lower 
25 cell in panel a and b). In order to gather biochemical evidence of the 
possible participation of PKD activity on Kidins220 protein levels, PKD- 
Wt, PKD-D733A and PKD-S744E/S748E were co-transfected with 
Kidins220-VSVG into COS-7 cells and total cell pellets were analysed by 
SDS-PAGE and immunoblotted with anti-PKD and anti-VSVG antibodies. 
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As shown in Figure 6C, Kidins220-VSVG was present in all samples but 
the one over-expressing the dominant negative mutant PKD-D733A. 
Moreover, when the constitutively active mutant PKX)-S744E/S748E was 
present, the signal of Kidins220-VSVG was increased (Figure 6C). The 
levels of expression of the three different PKD forms (PKD-Wt, PKD- 
D733A and PKD-S744E/S748E) were similar, as assessed by Western blot 
with specific anti-PKD antibodies (Figure 6C, PKD). These results strongly 
indicate that the degradation of the Kidins220-VSVG is not a consequence 
of over-expression and/or tagging of the recombinant protein, but reflects a 
functional effect of PKD activity on Kidins220 stability. Inhibition of PKD 
activity, by means of the dominant negative mutant PKD-D733A, leads to 
the specific aggregation and degradation of Kidins220, providing evidence 
that PKD phosphorylation of its protein substrate(s) might be a key 
component in the modulation of their intracellular traffic and turnover. 

EXPERIMENTAL PROCEDURES 
Cell culture and transfection 

Different cell lines: PC12, N2a, C6, U251, COS-7, NIH-3T3, HeLa and 
CHO, were obtained from the ICRF Cell Collection. When required, PCI 2 
cells were treated with NGF (50 ng/ml) for 5 days. PC12 cells were plated 
at 50-60% confluence on glass coverslips and, after 24 hours, transfection 
of 3 [uig of plasmid DNA was performed using the TransFast™ reagent 
(Promega, WI). 48 hours after transfection and NGF treatment (50 ng/ml), 
cells were fixed and processed for immunofluorescence* COS-7 cells were 
transfected using Lipofectin® reagent (Life Technologies, UK) according to 
manufacturer specifications and were used 72 h later. 
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Purification Of Kidins220 

PC12 cells were lysed in RIPA-buffer (25 mM Tris-HCl, pH 7.6, 1% Triton 
X-100, 1% sodium deoxycholate, 0.1% SDS, 150 mM NaCl, 2 mM EDTA, 
2 mM dithiothreitol) with protease inhibitors (1 mM 4-(2-aminoethyl)- 

5 benzenesulfonyl fluoride and 10 |j.g/mJ of aprotinin, leupeptin and pepstatin) 
for 60 min at 4°C. Extracts were iinmunoprecipitated with an anti-PKD- 
PKC|i antibody (D-20, sc-935, Santa Cruz Biotechnology, CA) for 3 hours 
at 4^C. The iiranunoprecipitate was loaded in a preparative 8% SDS-PAGE 
gel and stained with the Zinc-method (Bio-Rad, CA). The protein band was 

10 cut, trypsin digested for 8 hours at 37° C and extracted for 20 minutes with 
50% acetonitrile in 5% trifluoroacetic acid. Peptides were derivatised with 
N-succinimidyl-2-morpholine acetate and de novo sequenced by low-energy 
collision-activated dissociation using a LCQ ion-trap MS (ThermoQuest, 
CA) fitted with a nanoelectrospray source, as previously described (Hoss et 

15 aL, 1999). 

Cloning of rat Kidins220 cDNA. 

To clone most of the Kidins220 cDNA sequence, we = used reverse 
transcriptase PCR (RT-PCR) using as template niRNA obtained firom PC12 

20 cells. Forward and reverse primers were designed based on EST sequences 
and overlapping RT-PCR clones were fused to generate a partial sequence, 
which covered 70% of the final Kidins220 clone. To obtain the full-length 
cDNA, 5 '-RACE reactions were performed with the Marathon™ cDNA 
amplification kit (Clontech, UK). The absence of genomic DNA in the 

25 RNA samples was confirmed by the lack of PCR products when no reverse 
transcriptase was used. PCR products were subcloned in the pGEM-T Easy 
vector (Promega, WI) and at least three clones from each RT-PCR or each 
5 '-RACE reactions were sequenced on both strands. The complete DNA 
sequence of the rat Kidins220 clone has been deposited at GenBank with 
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the accession number AF239045. Several additional Kidins220 clones 
representing alternative splice variants were isolated and will be described 
in Example 2. Analysis of Kidins220 sequence has been performed using 
the following applications (intron-exon boundaries assignment by 
GENSCAN, http://CCR-081.mit.edu/GENSCAN.html; topology prediction 
by PSORT, http://www.psort.nibb.ac.ip: domain composition by Pfam 4.4, 
http://www.sanger.ac.uk/Software/Pfam;) 



RNA preparation and Northern Blot analysis 

10 Total RNA was prepared using TRIzol Reagent (Life Technologies, UK) 
following the manufacturer instructions. Northern blots were performed on 
nylon membranes (Nytran; Schleicher and Schuell, D) according to standard 
protocols. The full length Kidins220 cDNA was labelled and used to screen 
Northern blots containing 2 ^g of poly(A)+ RNA purified from different rat 

15 tissues (Clontech, CA) or lO^g of total RNA from different cell lines. Equal 
loading between different lanes was assessed by methylene blue staining of 
18Sand28SrRNAs. 



Eukaryotic expression 

20 The coding sequence of rat Kidins220 was cloned in a modified version of 
pcDNA3.1/V5-HisA (Invitrogen, CA) obtamed by insertion of the 
oligonucleotide 5'-TCG ACA CCC GGG AAG TTA ACG AGT CTA GAT 
CCA TCG a;tC ACC GGT GT-3'at the Xito I/Age I site. The amino- 
terminal Sma VXba I fragment of Kidins220 was cut from pGEM-T-Easy 

25 and subcloned in the new polylinker. The carboxy-terminal half of 
Kidins220, starting from the unique Xbal site, was reamphfied using a 
reverse oligonuclotide with a point mutation in the stop codon and a Clal 
site (5'-ATC GAT CTT TTC ACT TCT CCT TCC TGA GAG A-3'). This 
PCR product was digested with XbaVClal and sub-cloned in the new vector 
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to obtain Kidins220 complete cDNA followed by a His^-tag motif 
(pcDNA3-Kidins220-HisA). We added a Vesicular Stomatitis Virus 
Glycoprotein (VSVG) epitope sequence by introducing in the ClaVAgel the 
oligonucleotide 5'-CGA TTA CAC CGA TAT AGA GAT GAA GAG GOT 
GGG AAA GTC A-3' (pcDNA3-Kidins220-VSVG-HisA). cDNA 
constructs containing wild-type and mutant PKD sequences and their GFP 
tagging have been previously described (Iglesias et ah, 1998c; Matthews et 
a/., 1999). 

Immunoprecipitation, in vitro Kinase and ATP-binding assays 
Immunoprecipitation was performed with the desired primary antibodies 
and protein-A Sepharose beads for 3 hours at 4°C. Immune complexes were 
washed three times and analysed by SDS-PAGE and Western blot. 
Autophosphorylation and peptides phosphorylation were determined by in 
vitro kinase (IVK) assays as previously described (Iglesias et al,^ 1998c). 
Syntide-2, Kidins220-Ser919 (RQMQRTITRQMSFDLTK, AA 908-924), 
Kidins220-Ala919 (RQMQRTITRQMAFDLTK) and Kidins220-C-terminal 
(CAASSESTGFGEERSIL, AA 1747-1763) peptides were used at a final 
concentration of 2.5 mg/ml. For ATP-binding assays, 8-azido [^^P] a- ATP 
(2|xCi/sample) was added to the immunoprecipitates in the dark and 
subsequently photoactivated for 20 min on ice with a long-wave UV lamp 
(Vilber Lourmat, F) prior SDS-PAGE and autoradiography. 

Subcellular fractionation 

PC12 cellular fractionation was performed as previously described (Dittie et 
al., 1997). Membrane pellets were recovered from post-nuclear supematants 
by spinning at 100,000xg for 30 min. To study Kidins220 membrane 
properties, membrane fractions were resuspended in breaking buffer (250 
mM sucrose, 10 mM Hepes, 1 mM EDTA, 1 mM MgOAc plus protease 
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inhibitors) containing either 1 M KCl or 100 mM sodium carbonatC;, and left 
at 4°C for 30 min prior ultracentrifugation. Triton X-114 partitioning was 
done as described (Bordier, 1981). Kidins220 solubility in 1% Triton X- 
100, with or without 1% saponin, was tested in membrane pellets by 
incubation for 30 min at 4°C. Samples were centrifuged and fractions 
analysed in SDS-PAGE gel and Western blot. For peptide competition, the 
Kidins220 antiserum was pre-incubated with 5 mg/ml of immunising 
peptide for 30 min. 

Immusocytochemistry and electron microscopy 

Cells grown on coverslips were either directly fixed and permeabilized in 
methanol at -20'C for 4 min, or, in the case of Thy- 1, fixed for 4 min in 4% 
paraformaldehyde in PBS at 8X, followed by methanol for 5 min at -20^0. 
Samples were incubated with the corresponding primary antibodies for 1 h 
at room temperature, detected with the suitable fluorophore-conjugated 
IgGs before mounting in Mowiol 4-88 (Harco, UK). The antibodies used in 
these experiments include: rabbit polyclonal antibody against the Kidins220 
CAASSESTGFGEERESIL (AA 1747-1763) peptide, affinity-purified 
rabbit polyclonal antibody to rat GM130," rabbit polyclonal antibody against 
p62; monoclonal anti-Thy-1 (clone OX-7; Bukovsky et aL, 1983); 
monoclonal anti-Ergic53, monoclonal anti-VSVG epitope (P5D4; Soldati 
and Perriard, 1991), Texas Red- and Alexa A488-conjugated secondary 
antibodies were purchased fi-om Molecular Probes (Eugene, OR). Raw 
images acquired by confocal microscopy (Zeiss LSM510, D) with a x63 
Plan-Apochromatic oil objective were normalised for each colour 
separately. Confocal images presented are two dimensional projections of a 
z-series through the cell depth, except in some instances specified in figure 
legends, in which single sections have been used. Images were processed 
for presentation with Adobe Photoshop 5.0 (Adobe Systems, CA). Standard 
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electron microscopic techniques for fixation, sectioning and staining or 
immnnodecoration with gold-conjugated antibodies of the samples were 
used (Nakamura et al.^ 1995). 

5 DISCUSSION 

Kidins220, a new plasma membrane protein enriched in lipid rafts and 
neurites tips, associates with PKD 

We report here the isolation, molecular cloning and characterisation of a 
10 novel protein interacting with PKD, which presents orthologues in D. 
melanogasiei\ C elegans and mammals. Several lines of evidence point to a 
specific functional interaction between PKD and Kidins220: (1) PKD and 
Kidins220 interact in vitro and in vivo; (2) Kidins220 is directly 
phosphorylated by PKD, being the only additional phosphorylated protein 
15 in PKD immunoprecipitates (3) Kidins220 contains a consensus motif for 
PKD-specific phosphorylation and the peptide corresponding to this 
sequence is strongly phosphorylated by PKD (4) Kidins220 is degraded in 
vivo when PKD activity is abolished by the use of a dominant negative 
PKD mutant 

20 

The cloning of Kidins220 cDNA reveals no overall similarity to any 
previously characterised gene products. Kidins220 is an ATP-binding 
plasma membrane protein highly expressed in brain and PC 12 cells, where 
it accumulates at the neurite tips. Its primary sequence lacks catalytic 
25 domains, but it contains several motifs involved in protein-protein 
interaction. These include eleven ankyrin repeats, a proline rich region and a 
carboxy-terminal portion with a high content in serine and threonine 
residues. Ankyrin repeats have been found to promote cytoskeletal 
localisation, membrane cj^oskeleton bridge formation or protein-protein 
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interaction (Sedgwick and Smerdon, 1999). Although their role in 
Kidins220 is presently unknown, these repeats could couple Kidins220 to 
cytoskeletal components. The association of Kidins220 with the insoluble 
pellet after Triton X-1 14 partitioning supports this hypothesis. Interestingly, 
5 PKD colocalises with cortactin, an actim binding protein, in invadopodia of 
epithelial cells (Bowden et al, 1999). Cortactin is enriched in growth cones 
in neurones (Du et al, 1998), where actin cytoskeleton remodelling takes 
place (Forscher and Smith, 1988). These findings, together with Kidins220 
locaHsation at the tips of the neurites, suggest a function of PKD/Kidins220 

10 complexes in cytoskeletal reorganisation during neurite extension.. In 
addition to the ankyrin-repeats, Kidins220 has a proline-rich region that 
may serve as binding site for adaptor modules such as SH3 and WW 
domains (Nguyen et al, 1998). Finally, Kidins220 carboxy-terminal half is 
very abundant in serine and threonine (Ser/Thr) residues. This region could 

15 participate in interactions with Ser/Thr binding modules (Yaffe and Cantley, 
1999). 

The association of Kidins220 with the plasma membrane was predicted by 
the presence of four putative transmembrane domains in its sequence. This 

20 has been verified in cell fi-actionation and immunolocalisation experiments. 
The plasma membrane distribution of Kidins220 is uneven, presenting 
higher density in restricted membrane domains. By using two different 
approaches we have demonstrated the association of Kidins220 to DIGs. 
Kidins220 is solubilised in non-ionic detergents only when combined witih 

25 the cholesterol-sequestering agent saponin, indicating that it associates to a 
cholesterol-rich microenvironment. Double immunolabelling of NGF- 
treated PC12 cells with Kidins220 and Thy-1, a lipid rafts marker (Madore 
et al, 1999), showed specific overlapping of Kidins220 and Thy-1 
immunofluorescence. 
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What is the physiological importance of Kidins220 association with DIGs? 
DIGs have been proposed to function as platforms for the assembly of 
cytoplasmic and membranous signalling complexes and for polarised 
5 membrane trafficking (Brown and London, 1998). Other PKD-interacting 
proteins, such as BCR, Syk and PLCyl (Sidorenko et al, 1996; Cheng et 
al, 1999) and lipids involved in signal transduction, such as 
phosphatidylinositol-(4,5)-bisphosphate (PtdIns(4,5)P2), have been localised 
to rafts as well (Hope and Pike, 1996). The selective recruitment of 
10 signalling molecules to the plasma membrane is an essential prerequisite for 
the functioning of various signaUing cascades. This is, indeed, the case for 
PKD, which translocates from the cytosol to the plasma membrane after 
PDB stimulation (Matthews et al, 1999). In PC 12 cells, GFP-PKD migrates 
to the plasma membrane after phorbol ester stimulation, where Kidins220 is 

15 locaUsed (not shown). We suggest a physiological model in which the 
hydrolysis of DIGs-localised PtdIns(4,5)P2 by PLCs after receptor 
stimulation will increase the DAG levels in this lipid microdomains, 
triggering the recruitment of activated PKD to these areas. Kidins220 would 
therefore serve as a scaffold for the assembly of a signalling complex at 

20 lipid rafts, one of the members being PKD. 

How, where and when does PKD interact and phosphorylate 
Kidins220? 

25 Kidins220 and PKD form a stable complex that is not affected by 
stimulating conditions. Other proteins whose PKD interaction is 
independent of stimulation have been reported (Sidorenko et al., 1996; 
Nishikawa et al, 1998; Johannes et al, 1999; Waldron et al, 1999). 14.3.3 
proteins are the only example in which phorbol ester stimulation enhances 
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their association, although they are not a substrate for PKD (Hausser et al.^ 
1999). 

PKD is mainly cytosolic in resting PC 12 cells. However, it also associates 
5 with membrane fractions, including lipid rafts, which contain Kidins220. 
The shift of PKD to the plasma membrane after stimulation (Matthews et 
aL, 1999) would increase its transient contact witii Kidins220, favouring 
ftirther phosphorylation of Kidins220 by translocated active PKD 
molecules. Other authors have found PKD in complexes at the plasma 

10 membrane (Sidorenko et aL, 1996; Bowden et aLy 1999). In contrast to all 
these studies, PKD has been reported as a Golgi-localised kinase (Prestle et 
aL, 1996; Jamora et aL, 1999). Several PKC isozymes have different 
localisations depending on the cell type (Goodnight et al., 1995) or undergo 
a dynamic compartmentalisation (Hug and Sarre, 1993). The 

15 immunolocalisation of PKD to different compartments may therefore be 
consistent with a model in which PKD plays a variety of roles depending on 
intracellular localisation, cell type and/or basal metabolic activity. 

A critical issue in tihe understanding of the role of PKD is the identification 
20 of fimctionally relevant substrates of this kinase. Data from in vitro kinase 
assays in PC 12 cells not only confirm the PKD/Kidins220 interaction, but 
also show that Kidins220 is phosphorylated by this kinase and that 
phosphorylation is enhanced under stimulation conditions. Very little is 
known about the substrate specificity of PKD, considering that there were 
25 not as yet known PKD specific substrates. From previous reports (Van Lint 
et aL, 1995; Dieterich et aL, 1996; Nishikawa et aL, 1997), PKD appears to 
prefer substrates with basic residues upstream of the phosphorylatable 
serine, hydrophobic amino acids (F,L,V) at the +1 position and a critical 
Leu residue at the -5 position. Kidins220 contains a consensus motif for 



BNSDCXJID: <WO 02207B6A2_L> 



^ WO.02/20786 



PCT/GBOl/03977 



115 

PKD phosphorylation at Ser919 (Table I). Results from phosphorylation 
assays with the synthetic peptide corresponding to this region strongly 
suggest that this sequence is a target for PKD-Kidins220 phosphorylation in 
vivo. Although further studies to map the phosphorylated residues are 
5 needed, the data provided here demonstrate that Kidins220 is the first 
identified physiological substrate for PKD. This conclusion is iurther 
supported by the effect of PKD activity on the stability of ICidins220, as 
discussed below. 

10 PKD regulates Kidins220 tra£Bcking and turnover 

Kidins220 localisation and degradation is modulated by PKD activity, 
suggesting that PKD participates in the regulation of its substrate's 
availability. Kinases regulate different substrates by conferring specific 
phosphorylation patterns that affects conformation, stability, transport and 

15 cellular localisation. The lability of many regulatory proteins is cmcially 
important for Aeir function in the cells. In the case of signalling molecules 
or plasma membrane receptors, it is apparent that changes in their half-life 
could have significant impact on the activity of the corresponding signalling 
cascade. Here we show that PKD activity is a critical determinant of 

20 Kidins220 protein levels and cellular localisation, an indication for their in 
vivo connected function. 

Aspects that remain to be clarified are the molecular mechanisms by which 
PKD controls Kidins220 levels and localisation. At least two mechanisms 
25 could explain how phosphorylation and dephosphorylation of 
PKD/Kidins220 complexes might affect Kidins220 stability. The first 
involves a direct effect of PKD on the pathway controlling the intracellular 
transport of Kidins220, In this model, the inactivation of PKD activity 
would impair Kidins220 phosphorylation, which may be required to 
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maintain Kidins220 in a functional conformation and/or be essraitial for the 
targeting to its final cellular compartment. The final result would be 
Kidins220 mislocalisation and consequent routing to degradation. The fact 
that PKD activity has been described as important for protein transport 
5 along the secretory pathway (Prestle et ah, 1996; Jamora et al., 1999), 
strongly supports this being the case, A prediction of this model is that PKD 
dominant negative mutant would cause the accumulation of Kidins220 in 
large aggregates, as indeed shown by our experiments. 

10 Another possible mechanism envisages that Kidins220 dephosphorylation at 
the plasma membrane induces intemaKsation and down regulation by a 
mechanism either lysosomal or proteasome-mediated. Ubiquitination has 
been implicated as a primary determinant in the control of protein stability. 
In yeast, ubiquitination participates as an intemalisation signal (Hicke, 

15 1997). In mammals, the role of phosphorylation in triggering protein 
degradation linked to the ubiquitin pathway has been extensively described 
(Traenckner et aL, 1994; Galcheva-Garlova et aL, 1995; Mori et al., 1995; 
Strous et a/., 1996; Lu et al., 1998). However, few examples in which 
phosphorylation is linked to protein stabilisation are available. This is the 

20 case for c-Jun, in which Jun N-terminal kinase (JNK) targets the 
ubiquitination of non-phosphorylated c-Jun (Fuchs et al., 1996). Similar to 
JNK, PKD is associated to and phosphorylates Kidins220, suggesting a 
model in which ubiquitination cause Kidins220 disappearance. This 
hypothesis correlates well with the presence of a conserved motif for 

25 ubiquitination (Orford et al, 1997) within the Kidins220 sequence at 
residues 1481-1490 (SKLLPGKKSS) and residues 1637-1445 
(SICSEDKKS) (as numbered in Figure lb). Alternatively, PKD-dependent 
phosphorylation of Kidins220 could be affecting both secretory and 
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intemalisation pathways, leading to a complex model, which entwines the 
two mechanisms. 



This is the first report presenting evidence for PKD function in protein 
5 tumover. Recently, PKD has been implicated in the control of key cellular 
events such as lymphocyte activation (Sidorenko et aL, 1996), metastasis 
(Bowden et aL, 1999), tumour necrosis factor-induced apoptosis (Johannes 
et ah, 1998), growth factor receptor stimulation (Bagowski et al, 1999), 
Golgi organisation and protein transport (Prestle et aL, 1996; Jamora et al, 

10 1999). By introducing this novel PKD role, we propose a unifying 
mechanism by which PKD may control several cellular functions by 
altering the traffic and degradation of its interacting substrates. Kidins220, 
whose cloning and first characterisation is reported herein, is the first PKD- 
associated protein substrate whose localisation and degradation are affected 

15 by PKD activity. The development of specific inhibitors for PKD could 
therefore be important for the selective degradation of protein substrates 
that might play a central role in pathogenic processes, like 
neurodegenerative diseases or cancer. 

20 Example 2: Splice variants of the kidins220 protein 

The variants were obtained by direct PGR of cDNA obtained by reverse 
transcription of total nxRNA extracted firom PC12 cells. The sequences 
were verified by re-sequencing both strands twice, and the sequences which 
25 differ firom the sequence shown in Figure 10 are shown in Figure 8. 

PGR products were subcloned in the pGEM-T Easy vector (Promega, 
Madison, WI) and sequenced by using an ABI Prism 377 DNA sequencer 
and BigDye Temiinator technology (Apphed Biosystem, Warrington, UK). 
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Multiple clones from each RT-PCR or each 5 '-RACE reactions (> 3) were 
sequenced in both DNA strands. Variability anaong different clones was 



variability has been interpreted as possible presence of multiple Kidins220 
5 splicing variants. 

Note: variations in the central portion has been verified in full length 
Kidins220 and are independent from the primer used in RT-PCR. Total 
RNA was prepared following manufacturer instruction by using the TRIzol 
10 reagent (GIBCO). 

Example 3: Production of antibody specific for Kidins220 
phosphorylated at Serine 919 

15 An antibody able to specifically bind to phosphorylated rat Kidins220 but 
not to unphosphorylated rat Kidins220 was generated by immunising a 
rabbit with a peptide corresponding to the sequence 908-924 of rat 
Kidins220 coupled to KLH (Pierce) via an additional cysteine at the N- 
terminus. The antibody is polyclonal and recognises the phosphopeptide 

20 but not the imphosphoiylated one as determined by ELISA with 
immobilised peptide. The entire signal is abolished accordingly by 
incubation with the phosphorylated but not the unphosphorylated peptide. 
Affinity purification of the antibody was achieved on a Sulfolink matrix 
(Pierce) pre-derivatised with the phosphorylated peptide. Elution was with 

25 Glycine 200 mM pH 2.5. 



consistently reported in the central portion of Kidins220 cDNA. This 
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Example 4: Structural information of Kidins 220 
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aal7 - 391 Ankyrin repeat region circular profile. 

Ankyrin repeats (ANK) are tandemly repeated modules of about 33 amino 
5 acids. They occur in a large number of functionally diverse proteins mainly 
from eukaryotes. Many ankyrin repeat regions are known to function as 
protein-protein interaction domains. The conserved fold of the ankyrin 
repeat unit is known from several crystal and solution structures, e.g. from: 
- p53-binding protein 53BP2, Cyclin-dependent kinase inhibitor pl9Ink4d, 
10 NF-kappaB inhibitory protein IkB-alpha. It has been described as an L- 
shaped structure consisting of a beta-hairpin and two alpha-heUces. 

aa707-748 Domain of unknown function DUF38. 

This domain has no known function. It is found in many proteins from C. 

15 elegans. The domain is found associated with the F-box. The F-box domain 
was first described as a sequence motif found in cyclin-F that interacts with 
the protein SKPl. This motif is present in numerous proteins and serves as a 
link between a target protein and an ubiquitin-conjugating ^enzyme. The 
SCF complex (e.g., Skpl-Cullin-F-box) plays a similar role as an E3 ligase 

20 in the ubiqmtin protein degradation pathway. 

aal246-1256 Na+/K4- ATPase C-terminus. 

This extension is specific to the Na+/K+ ATPase subfamily of ATPases. 
The alpha chains of sodium/potassium-transporting ATPases (H+/K+ and 
25 Na+ZK+'ATPase) catalyze the hydrolysis of ATP, coupled with the 
exchange of sodium and potassium ions across the plasma membrane. The 
proteins are located in the cell membrane, the ion transport they mediate 
creating the electro-chemical gradient that provides the energy for the active 
tranq>ort of various nutrients. 
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aal204-1224 PIPLC X DOMAIN: 



Phosphatidylinositol-specific 



phospholipase 



X-box 



domain. 



10 



15 



20 



Phosphatidylinositol-specific phospholipase C, an eukaryotic intracellular 
enzyme, plays an important role in signal transduction processes. It 
catalyzes the hydrolysis of l-phosphatidyl-D-myo-inositol-3,4,5 
triphosphate into the second messenger molecules diacylglycerol and 
inositol- 1,4,5-triphosphate. All eukaryotic PI-PLCs contain two regions of 
homology, sometimes referred to as 'X-hox' and 'Y-box'. The order of these 
two regions is always the same (NH2-X-Y-COOH), but the spacing is 
variable. In most isoforms, the distance between these two regions is only 
50-100 residues but in the gamma isoforms one PH domain, two SH2 
domains, and one SH3 domain are inserted between the two PLC-specific 
domains. The two conserved regions have been shown to be important for 
the catalytic activity. 

aall85-1211 Ly-6 / u-PARA 

A variety of GPI-linked cell-surface glycoproteins are composed of one or 
more copies of a conserved domain of about 100 amino-acid residues. 
These proteins are: (i) Urokinase plasminogen activator surface receptor u- 
PAR, which binds to u-PA and is responsible for its proteolysis- 
independent signal transduction activation effects, (ii) The mouse family of 
Ly-6 T-cell antigens (Ly-6A, -6C, -6E, -6F and -6G) and (iii) CD59 
(MACIF or MIRL), an inhibitor of the complement membrane attack 
complex action. 
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Example 5: Distribution of Kidins220 in developing mice embryos. 

A polyclonal antibody specific for Kidins220 (GSC16) which was raised to 
the peptide CAASSESTGFGEERSIL (amino acids 1747 to 1763 of the 
5 sequence shown in Figure lb) was used to investigate the distribution of 
Kidins220 in developing mice embryos at stage 14, 14 days post coitum. 
Kidins220 is shown to be expressed in all brain but is particularly 
concentrated in the tissue underlying the ventricular vesicle and in a layer of 
migrating neurons which will form the cortex (Figure 20). Kidins220 is 
10 present also in the Choroid Plexus, which is differentiating in the pavement 
of the Fourth ventricle. 



15 



20 



25 



Example 6; Distribution of Kidins220 in the brain. 

Labelling of Kidins220 shows that additional regions of the brain that 
contain high levels of Kidins220 are the dorsal root ganglia, the trigeminal 
ganglion and the differentiating neuroepithelium of the inner ear (cochlear 
canal) (Figure 21). Other structures that appear labelled are the primordia of 
the tactive and sinus air follicle and the developing external ear. 

Example 7: SAGE analysis of human Kidins220 expression. 

The human Kidins220 nucleotide sequence was analysed using the 
SAGEmap vNorthem tool provided by the Cancer Genome Anatomy 
^^^'J^''* (http://cgap.nci.nih.gov/). The sequence is shown to correspond to 
the Unigene cluster Hs.9873 with a corresponding SAGE tag 
"CTAAAAAAAA". In silico analysis of the relative number of tags 
encountered in various libraries (which number is an indication of the 
relative levels of expression of the gene in the corresponding tissue; 
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www.sagenet.org) reveals that Kidins220 is over-expressed in a variety of 
tumours, especially in glioblastoma multiforme and prostate cancers (Figure 
22). 

5 Example 8: Nuclear localisation of Kidin220 in the astrocytoma- 
glioblastoma U87MG cell line. 

Analysis of Kj[dins220 expression in a human cell line derived from an 
astrocytoma/glioblastoma (U87MG, ICRF cell depository) confirmed the 

10 high level of expression of this protein in this type of neoplasm of the CNS. 
This high level of expression is accompanied by the shift of Kidins220 
localisation from the plasma membrane to the cell nucleus (Figure 23). 
Although the molecular mechanism responsible for this differential 
trafficking remains unknown, the nuclear localisation of Ridins220 could 

15 represent a diagnostic sign to follow neoplastic progression and to 
distinguish neoplastic glia-derived cells from neighbouring neurons. 
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CLAIMS 

1. A polynucleotide comprising a nucleotide sequence which encodes 
the polypeptide whose amino acid sequence is given in Figure lb or 
i Figure 11 or a fragment or variant or fusion thereof, or a fusion of 

said fragment or variant, or a complement thereof, provided that the 
polynucleotide is not any one of the clones corresponding to the 
GenBank Accession Nos AL133620. AB033076 AC007175, F36H1, 
or any Accession No. given in Figure 14a or 14b. 

A polynucleotide according to Claim 1 comprising a nucleotide 
sequence encoding a polypeptide whose amino acid sequence is 
given in Figure 8. 

15 3. A polynucleotide according to Claim 1 comprising the nucleic acid 
sequence given in Figure 7. 

4. A polynucleotide according to Claim 1 comprising a nucleic acid 
sequence given in Figure 8. 

20 

5. A polynucleotide according to Claim 1 which encodes a polypeptide 
with at least 65% identity to the amino acid sequence given in Figure 
lb or Figure 11. 

25 6. A polynucleotide according to Qaim 5 which encodes a polypeptide 
with at least 85% identity to the amino acid sequence given in Figure 
lb or Figure 11. 
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7. A polynucleotide according to Claim 5 which encodes a polypeptide 
with at with at least 95% identity to the amino acid sequence given in 
Figure lb or Figure 11. 

5 8. A polynucleotide according to any one of Claims 1 to 7 which 
encodes any one of the peptides CAASSESTGFGEERSIL, 
RQMQRTITRQMSFDLTK or RQMQRTITRQMAFDLTK or a part 
thereof which is no less than 10 contiguous amino acids. 

10 9. A polynucleotide according to any one of flie preceding claims which 
is at least 30 nucleotides in length. 

10. A polynucleotide according to Claims 1 to 9 which contains no 
introns. 

15 

11. A polynucleotide which is no more than 250bp and which is able to 
selectively hybridise to any one of the polynucleotides as defined in 
Claims 1 to 10. 

20 12. A polynucleotide according to Claim 11 which is between 10 and 
200 nucleotides in length. 

13. A polynucleotide according to any one of the preceding claims 
further comprising a detectable label. 



25 



14. An expression vector comprising a polynucleotide according to any 
one of Claims 1 to 10. 
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15. An expression vector according to Claim 14 which is capable of 
expressing a polypeptide whose amino acid sequence is shown in 
Figure lb or Figure 1 1, or a jfragment or variant or fusion thereof or a 
fusion of said fragment or variant. 

16. A replicable vector comprising a polynucleotide as defined in any 
one of Claims 1 to 10. 

17. A polynucleotide comprising the sequence shown in Figure 16 or a 
fragment or variant thereof provided that it is not the clone 
corresponding to GenBank Accession No AC012495. 

18. A polynucleotide according to Claim 17 which is capable of 
promoting transcription. 

19. A polynucleotide according to Claim 18 wherein the transcription is 
neuron specific. 

20. A polynucleotide according to any one of Claims 17 to 19 operably 
linked to a polynucleotide encoding a polypeptide. 

21. A polynucleotide according to Claim 20 wherein the polypeptide is a 
tiierapeutic polypeptide. 

22. A polynucleotide according to Claim 20 wherein the polynucleotide 
encoding a polypeptide is a polynucleotide according to any one of 
Claims 1 to 10, or a polypeptide capable of expressing a polypeptide 
whose amino acid sequence is given in Figure lb or Figure 1 1 or a 
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fragment or variant or fusion thereof, or a fusion of said fragment or 
variant, 

23. A polynucleotide according to Claim 20 wherein the polypeptide is a 
5 detectable polypeptide. 

24. A polynucleotide according to Claim 20 wherein the polypeptide is 
Protein Kinase D or a functional mutant thereof. 

10 25. A nucleic acid vector comprising a polynucleotide as defined in any 
one of Claims 17-24. 



26. An expression vector according to either of Claims 14 or 15 or a 
polynucleotide according to any one of Claims 17 to 25 for use in 
15 therapy. 



27. A recombinant host cell containing a polynucleotide according to any 
one of Claims 1 to 10 or Claims 17-24 or a replicable vector or an 
expression vector according to any one of Claims 14 to 16 or Claim 

20 25. 

28. A host cell according to Claim 27 wherein the host cell further 
comprises a polynucleotide capable of expressing Protein Kinase D 
or a functional equivalent thereof. 

25 

29. A recombinant host cell according to Claims 26 to 28 where the host 
cell is a bacterial cell. 
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30. A recombinant host cell according to Claims 26 to 28 where the host 
cell is a mammalian cell. 

31 . A recombinant host cell according to Claims 26 to 28 where the host 
cell is an insect cell. 

32. Use of a polynucleotide according to any one of Claims 1 to 10 and 
14 to 16 in tiie production of the encoded polypeptide. 

33. A polypeptide comprising the amino acid sequence given in Figure 
lb or Figure 1 lor a fragment or variant or fusion thereof or a fusion 
of said fragment or variant provided that said polypeptide is not tiie 
polypeptide encoded by GenBank Accession No AB033076. 

34. A polypeptide according to Claim 33 which has an amino acid 
sequence with at least 65% identity to the amino acid sequence of 
Figure lb or Figure 1 1 . 

35. A polypeptide according to Claim 33 which has an amino acid 
sequence with at least 85% identity to the amino acid sequence of 
Figure lb or Figure 11. 

36. A polypeptide according to Claim 33 which has an amino acid 
sequence with at least 95% identity to the amino acid sequence of 
Figure lb or Figure 11. 

37. A polypeptide according to Claim 33 comprising an amino acid 
sequence given in Figure lb wherein the amino acids from residues 
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1140 to 1232 are replaced with Uie amino acids whose sequence is 
shown between the square brackets in Figure 8. 

A polypeptide according to any one of Claims 33 to 37 wherein the 
serine located at position 919 as numbered in Figure lb, or an 
equivalent position, is phosphoiylated. 

A polypeptide according to any one of Claims 33 to 37 wherein the 
serine located at position 919 as numbered in Figure lb, or an 
equivalent position, is altered to an amino acid other than serine. 

A polypeptide according to any one of Claims 33 to 37 wherein the 
tjo-osine residue located at any one of positions 409 or 1453 as 
numbered in Figure lb, or an equivalent position, is altered to an 
amino acid other than tyrosine. 

A polypeptide according to any one of Claims 33 to 37 wherein any 
one of the lysine residues located at any one of the positions 473, 
1482, 1487, 1488, 1643 or 1644 as numbered in Figure lb, or an 
equivalent position, is altered to an amino acid other than lysine. 

A polypeptide comprising the sequence (I/L)XRQM(S/A)J wherein 
X is any amino acid, J is a hydrophobic amino acid, (I/L) represents 
isoleucine or leucine and (S/A) represents serine or alanine and 
which is not the polypeptide encoded by the nucleotide sequence in 
GenBank Accession No AB033076 and which is not \x peptide 
(AALVRQMSVAFFFK). 
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43. A polypeptide comprising the amino acid sequence 
CAASSESTGFGEERSIL or a variant or fragment or fusion thereof 
or a fusion of said fragment or variant or an analogue thereof 
provided that it is not the polypeptide encoded by the nucleotide 
5 sequence in GeuBank Accession No AB033076. 



44. A polynucleotide which encodes a polypeptide according to Claims 
33 to 43. 

10 45. A method of producing a peptide or polypeptide according to any 
one of Claims 33 to 43 comprising expressing a polynucleotide of 
any one of Claims 1 to 10 which encodes said peptide or polypeptide. 

46. A method of producing a peptide or polypeptide according to any 
15 one of Claims 33 to 43 comprising culturing a host cell according to 

any one of Claims 27 to 31. 

47. A method according to Claim 45 wherein the polynucleotide is 
translated in an in vitro translation system. 

20 

48. A method according to Claims 45 to 47 wherein the method 
comprises the further step of isolating said polypeptide. 

49. A method according to Claim 48 wherein the isolation step employs 
25 an antibody which selectively binds the expressed polypeptide. 

50. An antibody which specifically binds to a polypeptide whose amino 
acid sequence is given in Figure lb or Figure 11, or a fragment or 
variant of the said polypeptide. 
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An antibody according to Claim 50 which does not bind to a variant 
wherein the amino acids from residues 1140 to 1232 in Figure lb are 
replaced with the amino acid whose sequence is shown between the 
square brackets in Figure 8. 

An antibody according to Claim 50 which specifically binds to a 
variant wherein the amino acids from residues 1140 to 1232 in 
Figure lb are replaced with the amino acid whose sequence is shoAvn 
between the square brackets in Figure 8 and which antibody does not 
bind to a polypeptide whose amino acid sequence is given in Figure 
lb. 

An antibody which specifically binds to a polypeptide comprising 
any of the amino acid sequences TEDAAEGLPS, CHRQLTVTEF, 
RQVQKLQAAV, DQNNGLAAVP, HPFYNRAAVP, 

HPFYNRANIN and FFAPYL YTPR. 

An antibody according to Claim 50 whose binding can distinguish 
between one or a selection made from the polypeptide whose amino 
acid sequence is given in Figure lb and the polypeptide variants 
wherein the amino acids from residues 1 140 to 1232 in Figure lb are 
replaced with the amino acid whose sequence is shown between the 
square brackets in Figure 8. 

An antibody according to Claim 50 which specifically binds to the 
amino acid sequence CAASSESTGFGEERSIL. 
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56. An antibody according to Claim 50 which specifically binds to a 
polypeptide whose amino acid sequence comprises at least five 
consecutive residues including residue 919 of the polypeptide as 
numbered in Figure lb or an equivalent position of tiie polypeptide 
of Figure 11. 

57. An antibody according to Claim 56 which specifically binds to the 
amino acid sequence RQMQRTITRQMSFDLTK. 

58- An antibody according to any one of Claims 50, 51, 56 and 57 
wherein the serine at residue 919 as numbered in Figure lb, or an 
equivalent residue in Figure 1 1, is phosphorylated. 

59. An antibody according to any one of Claims 50, 51, 56 and 57 
wherein the serine at residue 919 as numbered in Figure lb, or an 
equivalent residue in Figure 11, is non-phosphorylated. 

60. An antibody as defined in any one of Claims 50 to 59 which is a 
monoclonal antibody. 

61. An antibody according to any one of Claims 50 to 59 fiirther 
comprising a detectable label. 

62. A method of producing an antibody according to any one of Claims 



63. A mefliod of modulating the activity of a polypeptide according to 
any one of Claims 33 to 42 the method comprising contacting the 



50 to 59. 
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polypeptide with an antibody according to any one of Claims 50 to 
61, 

64. A method according to Claim 63 wherein addition of an antibody 
5 according to any one of Claims 50, 51, 56, 57 and 59 prevents a 

region of the polypeptide from interacting with another protein. 

65. A method according to Claim 63 wherein addition of an antibody 
according to any one of Claims 50, 51, 56, 57 and 59 prevents 

10 phosphorylation of serine 919 of the polypeptide as numbered in 

Figure lb or an equivalent residue. 

66. A method of modulating the function of a polypeptide according to 
any one of Claims 33 to 37 when in the presence of Protein Kinase 

15 D, the method comprising inhibiting the phosphorylating ability of 

Protein Kinase D. 

67. A method of modulating the function of the polypeptide according to 
any one of Claims 33 to 42 comprising reducing the ability of said 

20 polypeptide to bind ATP . 

68- A method of identifying an agent which modulates the promoter 
activity of the polynucleotide according to any one of Claims 17-24, 
said method comprising 

25 

i) introducing said polynucleotide into a suitable cell; 

ii) exposing a test agent to said cell; 

iii) detecting whether (and, optionally, to what extent) the 
promoter activity of the polynucleotide has been altered. 
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69. A method of identifying an agent which modulates the activity of the 
polypeptide according to Claims 33 to 42, said method comprising: 

i) exposing a test agent to said polypeptide and; 

5 ii) detecting whether (and, optionally, to what extent) the activity 

of said polj^eptide has been altered. 

70. A method of identifying an agent which modulates the activity of 
Protein Kinase D, the method comprising: 

10 i) exposing Protein Kinase D or a functional equivalent thereof 

to a peptide or polypeptide comprising the amino acid sequence 
(I/L)XRQMSJ, where X is any amino acid, J represents a 
hydrophobic residue and (I/L) represents isoleucine or leucine or 
to a peptide or polypeptide comprising at least five consecutive 

15 residues including residue 919 of the polypeptide as numbered in 

Figure lb; and 

ii) detecting, in the presence or absence of the test agent whether 
(and, optionally, to what extent) said peptide or polypeptide has 
been phosphorylated. 

20 

7L A method according to Claim 70 wherein the test agent is exposed to 
Protein Kinase D or a functional equivalent thereof prior to the 
Protein Kinase D or a functional equivalent thereof being exposed to 
the said peptide or polypeptide. 

25 

72. A method according to Claim 70 or 71 wherein the polypeptide 
comprising the amino acid sequence (I/L)XRQMSJ, has an amino 
acid sequence with at least 65% identity to the amino acid sequence 
of Figure lb. 
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73. A method of identifying an agent which modulates the interaction 
between Protein Kinase D and the polypeptide whose amino acid 
sequence is shown in Figure: lb or a fragment or variant thereof or 
fusion or a fusion of said fragment or variant, the method comprising 
detecting in the presence or absence of the test agent whether Protein 
Kinase D interacts with the said polypeptide. 

74. An agent identifiable by the method of any of Claims 68 to 72. 

75. An agent identified by the method of any of Claims 68 to 72. 

76. An agent identifiable or identified by the method of any of Claims 68 
to 72 for use in medicine. 

77. A polynucleotide according to any one of Claims 1 to 12 or Claims 
17 to 24 or a polypeptide according to any one of Claims 33 to 43 for 
use in medicine. 



20 78. A polynucleotide according to Claims 1 to 12 or Claims 17 to 24 
further comprising a vector suitable for use in gene therapy. 

79. A pharmaceutical composition comprising an agent identifiable or 
identified by the method of any of Claims 68 to 72 or a 
25 polynucleotide according to any one of Claims 1 to 12 or a 

polypeptide according to any one of Claims 33 to 43 and a 
pharmaceutically acceptable carrier. 
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80. Use of an agent identifiable or identified by the method of any of 
Claims 64 to 67 or a polynucleotide according to any one of Claims 1 
to 12 or Claims 17 to 24 or a polypeptide according to any one of 
Claims 33 to 43 in the manufacture of a medicament for treatment of 
neurodegenerative disease. 

81. Use of an agent identifiable or identified by the method of any of 
Claims 64 to 67 or a polynucleotide according to any one of Claims 1 
to 12 or Claims 17 to 24 or a polypeptide according to any one of 
Claims 33 to 43 for treating neurodegenerative disease. 

82. A method of modulating the function of Protein Kinase D the method 
comprising the step of contacting Protein Kinase D with a peptide or 
polypeptide according to any one of Claims 33 to 39 and 42. 

83. A kit of parts comprising Protein Kinase D or a fimctional equivalent 
thereof and a polypeptide whose amino acid sequence is shown in 
Figure lb as a firagment or variant or fiision thereof or a polypeptide 
according to Claim 42. 

84. A kit of parts according to Claim 83 wherein the Protein Kinase D is 
a constitutively active mutant. 

85. A kit of parts according to Claim 83 or 84 fiarther comprising a 
means for detecting phosphorylation of a protein. 

86. A mutant animal wherein if the organism contains in its genome a 
polynucleotide which encodes a pol>T>^ptide comprising the amino 
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acid sequence given in Figure lb or a fragment or a variant thereof 
the polynucleotide in the nniutant organism is mutated or absent. 

A mutant organism according to Claim 86 which is a mouse. 

A polypeptide comprising any one of the amino acid sequences 
TEDAAEGLPS, CHRQLTVTEF, RQVQKLQAAV, 

DQNNGLAAVP, HPFYNRAAVP and HPFYNRANIN. 

A polypeptide according to Claim 88 which has between 10 and 50 
amino acids. 

A method of diagnosing cancer in a patient comprising the steps of 

(i) determining the level of expression of the Kidins220 gene in a 
sample from the patient containing protein or nucleic acid; 

(ii) comparing the level of expression of step (i) with that in a 
control sample. 

A method of monitoring the progression of cancer in a patient 
comprising the steps of 

(i) determining the level of expression of the Kidins220 gene in a 
sample from the patient containing protein or nucleic acid;; 

(ii) comparing the level of expression of step (i) with that in a 
previous sample from the same patient. 

A method according to Claims 90 or 91 wherein the level of 
expression of said gene is determined using an antibody capable of 
selectively binding to the Kidins220 polypeptide or a polynucleotide 
according to any one of Claims 1 to 13. 
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93. A method according to Claim 92 wherein the antibody is one 
according to any one of Claims 50 to 61. 

94. A method of diagnosing cancer in a patient comprising the steps of 

(i) determining tihe phosphorylation status of the Kidins220 
polypeptide in a sample from the patient containing protein; 

(ii) comparing the phosphorylation status of step (i) with that in a 
control sample. 

95. A method of diagnosing cancer in a patient comprising the steps of 

(i) determining the cellular location of the Kidins220 polypeptide 
in a sample from the patient containing cells; 

(ii) comparing the cellular location of step (i) with that in a 
control sample. 

96. A method of monitoring the progression of cancer in a patient 
comprising the steps of 

(i) determining the phosphorylation status of the Kidins220 
polypeptide in a sample from the patient containing protein; 

(ii) comparing the phosphorylation status of step (i) with that in a 
control sample. 

97. A method of monitoring the progression of cancer in a patient 
comprising the steps of 

(i) determining the cellular location of the Kidins220 polypeptide in 
a sample from the patient containing cells; 

(ii) comparing the cellular location of step (i) with that in a control 
sample. 
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98. A method of treating cancer in a patient comprising administering to 
the patient an agent which inhibits function of the Kidins220 gene or 
its product. 

99. A method according to Claim 98 comprising administering to the 
patient a polynucleotide which is antisense to a polynucleotide 
according to any one of Claims 1 to 10 or 17 to 20 or 22, or an 
antibody according to any one of Claims 50 to 61. 

100. A method according to any one of Claims 90 to 99 wherein the 
cancer is glioblastoma multiforme or prostate cancer. 

101. An agent capable of inhibiting the function of tlie Kidins220 gene or 
its product, an antibody according to any one of Claims 51 to 61 or a 
polynucleotide which is antisense to a polj^ucleotide according to 
any one of Claims 1 to 10 or 17 to 20 or 22, for use in medicine. 

102. Use of an agent capable of inhibiting the function of the Kidins220 
gene or its product in the manufacture of a medicament for treating 
cancer. 

103. Use according to Claim 102 wherein the agent is an antibody 
according to any one of Claims 51 to 61 or a polynucleotide which is 
antisense to a polynucleotide according to any one of Claims 1 to 10 
or 17 to 20 or 22. 

104. Use of an agent which is capable of detecting the expression of the 
Kidins220 gene in the manufacture of a diagnostic reagent for 
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diagnosing or prognosing cancer or for monitoring the progression of 
cancer in a patient. 

105. Use according to Claim 104 wherein the agent is an antibody 
according to any one of Claims 50 to 61 or a polynucleotide 
according to any one of Claims 1 to 13. 

106. Use according to any one of Claims 102 to 105 wherein the cancCT is 
glioblastoma multiforme or prostate cancer. 

107. A compound comprising (i) a moiety which selectively binds the 
Kidins220 polypeptide or a variant thereof and (ii) a further moiety. 



108. A compound according to Claim 107 wherein the moiety which 
15 selectively binds is an antibody. 

109. A compound according to either of Claims 107 or 108 wherein the 
further moiety is a readily detectable moiety. 

20 110. A compound according to any one of Claims 107 or 108 wherein the 
further moiety is a directly or indirectly cytotoxic moiety. 

111. A nucleic acid molecule encoding a compoxmd according to Claim 
107. 



25 



112. A method of imaging cancer in the body of an individual the method 
comprising administering to the individual an effective amount of a 
compound according to Claim 109. 
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113. A method of diagnosing or prognosing cancer in an individual the 
method comprising administering to the individual an effective 
amount of a compoimd according to Claim 109. 

5 114. A method of treating an individual in need of treatment, the method 
comprising administering to the individual an effective amount of a 
compound according to any one of Claims 107, 108 or 1 10. 



115. A pharmaceutical composition comprising a compound according to 
10 any one of Claims 107 to 110 and a pharmaceutically acceptable 

carrier. 

116. A compound according to any one of Claims 107 to 110 for use in 
medicine. 

15 

117. Use of a compound according to Claim 109 in the manufacture of an 
agent for imaging cancer in a body of an individual. 

118. Use of a compound according to Claim 109 in the manufacture of a 
20 diagnostic or prognostic agent for cancer. 



119. Use of a compound according to any one of Claims 107, 108 or 1 10 
in the manufacture of a medicament for treating cancer. 



25 120, A kit of parts comprising a compound according to Claim 107 and 
any one of a directly or indirectly cytotoxic moiety or a readily 
detectable moiety to which the said compound is able to bind via its 
further moiety. 
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5 ' un-transla-ted 

gtcctctgacaaggtctgcggccctgtgagctggctctacagagccgagggcggagtc 
ctcagccaggcc.gagatacggtgaattaaagtcggcaatagatttctgtgaaa 

ORF 

ATGTCAGTTCTTATATCACAGAGTGTGATAAATTATGTGGAAGAAGAGAATATTCCTG 

CTCTGAAAGCTCTTCTTGAGAAGTGCAAAGATGTAGATGAGAGGAACGAGTGTGGCCA 

GACGCCGCTGATGCTGGCTGCGGAGCAGGGCAATGTGGAAATAGTGAAAGAACTGTTA 

AAGAATGGAGCCAACTGCAATCTGGAAGATCTGGATAACTGGACTGCCCTTATATCGG 

CATCTAAGGAGGGGCACATCCACATTGTGGAGGAGCTGCTGAAGAGCGGGGCCAGTCT 

GGAGCACCGAGACATGGGTGGATGGACAGCTCTCATGTGGGCGTGCTACAAAGGCAGG 

ACCGACGTGGTCGAGCTGCTTCTTTCTCATGGTGCCAACCCAAGTGTCACTGGTTTGC 

AGTATAGTGTCTACCCAATCATCTGGGCAGCGGGCAGAGGCCATGeCGACATAGTGCA 

TCTTCTACTGCAGAATGGTGCGAAAGTCAATTGCTCCGACAAGTACGGAACAACCCCC 

TTGGTATGGGCTGCCCGAAAGGGTCATTTGGAATGTGTGAAGCACTTGTTGGCCATGG 

GAGCTGATGTTGATCAAGAAGGAGCCAATTCGACGACTGCACTTATCGTGGCAGTGAA 

AGGAGGCTACACACAGTCCGTGAAAGAGATTTTGAAAAGGAATCCAAATGTGAACCTA 

ACTGACAAGGATGGAAACACAGCTCTGATGATCGCATCGAAGGAGGGACATATTGAGA 

TTGTGCAGGACCTGCTGGACGCCGGAACATACGTGAACATACCTGACAGGAGTGGGGA 

TACTGTATTGATCGGCGCTGTGAGAGGTGGTCATGTTGAAATTGTTCGAGCACTTCTC 

CAAAAATATGCTGATATAGACATTAGAGGGCAGGACAATAAGACCGCGCTGTATTGGG 

CCGTTGAAAAGGGAAATGCGACAATGGTGAGGGATATCCTGCAGTGCAATCCGGACAC 

GGAAATATGCACGAAGGATGGTGAGACACCACTGATAAAAGCCACCAAGATGAGAAAT 

ACTGAAGTGGTGGAGCTGCTGCTGGATAAGGGGGCTAAGGTGTCTGCTGTGGACAAGA 

AAGGGGACACTCCCCTGCACGTTGCCATCCGAGGGAGGAGCCGGAGACTGGCCGAACT 

TCTTTTACGGAACCCCAAAGATGGACGGTTACTCTACAGACCCAACAAAGCAGGCGAG 

ACTCCCTACAACATTGACTGCAGCCACCAGAAAAGCATTTTAACTCAAATATTTGGAG 

CCAGACACTTGTCTCCTACGGAAACAGATGGCGACATGCTTGGTTATGATTTGTATAG 

CAGTGCCCTGGCCGATATTCTCAGTGAACCCACCATGCAACCACCCATTTGTGTGGGC 

TTGTATGCACAGTGGGGAAGTGGGAAGTCTTTCTTACTCAAGAAACTAGAAGATGAGA 

TGAAGACGTTTGCAGGACAGCAGACCGAGCCCCTCTTCCAGTTCTCGTGGCTCATAGT 

GTTCCTCACCCTGCTGCTGTGTGGCGGGCTCGGCCTAGTGTTTGCTTTCACGGTTGAT 

ACAAACCTCGCCATAGCCATTTCATTGAGTTTCCTGGCTCTCATATATATATTCTTCA 

TCGTCATTTACTTCGGTGGACGGCGGGAAGGAGAGAGTTGGAACTGGGCCTGGGCTCT 

CAGTACCAGACTGGCAAGACACATTGGGTACCTGGAGCTCCTCTTCAAACTGATGTTT 

GTGAACCCACCTGAACTTGCGGAGCAAACTACAAAGGCTTTACCTGTGAGGTTTTTGT 

TTACAGATTACAATAGACTCTCCAGTGTGGGAGGAGAAACTTCCTTGGCTGAAATGAT 

CGCGACCCTCTCAGATGCTTGTGAGAGAGAATTCGGCTTTTTGGCTACCCGTCTTTTT 

CGAGTCCTCAGGACTGAAGAGTCTCAGGGTAAAAAGAAATGGAAAAGAACATGCTGCC 

TCCCCTCTTTTGTCATCTTTCTTTTTATTGTTGGCTGCATTATTGCTGGGATCACCCT 

TCTGGCTATATTCAGAGTTGACCCGAAACATCTGACAGTGAATGCTATCCTCATTTCT 

ATCGCATCTGTCGTGGGGTTGGCCTTCGTGCTCAATTGTCGAACATGGTGGCAAGTGC 

TGGACTCTCTCCTGAATTCTCAGAGAAAACGCCTCCATAGTGCTGCCTCCAAATTACA 

CAAGTTAAAAAGTGAAGGATTCATGAAAGTCCTGAAGTGTGAGGTGGAGCTGATGGCC 

AGGATGGCAAAAACCATTGACAGCTTCACTCAAAACCAGACAAGGCTGGTGGTTATCA 
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TTGACGGCCTGGATGCCTGCGAGCAGGATAAAGTCCTTCAGATGCTGGACACTGTCCG 
AGTTCTGTTTTCAAAGGGCCCTTTTATTGCCATTTTTGCAAGTGATCCGCATATTATC 
ATAAAGGCCATCAACCAGAACCTGAATAGTGTGCTTCGTGACTCAAATATAAACGGAC 
ATGACTACAT.GCGCAATATAGTTCACTTACCAGTTTTTCTTAATAGTCGTGGGCTTAG 
CAATGCAAGGAAATTCCTTGTAACTTCAGCAACAAATGGGGACATTACATGCTCAGAC 
ACCACAGGAAGACAGGAGGACACTGACAGAAGAGTTTCACAGAACAGCCTTGGGGAGA 
TGACAAAGC-TTGGGAGCAAAACGGCTCTCAACAGAAGGGACACTTACCGCAGAAGACA 
GATGCAGA^AAGGATCACGAGGCAGATGTCCTTTGATCTCACAAAGCTGCTGGTCACT 
GAGGATTGGTTCAG^TGACATCAGCCCTCAGACCATGAGACGGCTACTCAATATTGTTT 
. CTGTGACAGGGCGGCTACTAAGAGCCAATCAGATT ACTTTCAACT GGGACAGGCT AGO 

cagctggatcaacctcaccgagcagtggccgtaccggacgtcttggcttatactatat 
ctggaagagactgaaggtctccctgatcagatgaccttaaagaccatctatgagagaa 
tatcgaagaatatcccaacaactaaagatgttgagccgcttcttgaaattgatggaga 
tataagaaactttgaagtcttcttgtcttcaaggactccagttctcgtggctcgagat 
gtaaagacctttttgccgtgcacagtaaacctggatcccaaacttcgggagaccatcg 
cagatgtccgagctgcaagagagcagatcaacataggaggcctggcctatcccccgct 
ccctctgcacgaaggccctcctcggccaccttctgggtacagtcagcctgcatctgtc 
tgctcttcagcatctttcaacgggccgttcccaggcggggttgtgtctccacagcccc 
acagcagttactacagcggcttgtccggaccccagcaccccttctacaacaggccatt 
ctttgccccatacctttacacgccaaggtattaccctggcggttcccaacatctcatc 
tcacgttcatcagtaaaagcgagtttgcccagagatcagaacaatggcctaccgtgtg 
actctgggtttaacaaacagagacagggcagccgtccctgccacaggcagctcactgt 
tact gagtt caat gaccgt ggat gt cgttgt gagaaact gagacagatagaagggctg 
gaccagagcatgatgcctcagtactgcacaacaatcaaaaaggcgaacataaatggcc 
gggt att gt ct c agt gt aac att gat gagct gaagaaagagat ggct atgaact t t gg 
agactggcatct cttcagaagcatggtcttagaaat gagaagtgtggaaagccaggtg 
gtccctgaagaccctcgttttctcaatgaaaactccagtgccccagttcctcatggag 
agtctgctcgccgcccttctcacactgagctgcccctcacggagctttccagccagac 
tccatacaccctgaacttcagctttgaagaactgaacacgcttggccttgatgagggt 
gcccctcgtcacagtaatctaagttggcagtcacaaactcgcagaaccccaagtctct 
cgagtctcaattcgcaggactccagtattgaaatttcaaagcttactgataaggtgca 
ggccgagtatagagacgcctatagagagtacattgctcagatgtcccagttagaaggg 
ggcacagggtcgtccacaataagcggcagatcttctccacatagcacatactacatag 
gccagagctcttcagggggctccatccattctactctagaacaggaaagggggaagga 
gggtgagctaaagcaggaggatgggcggaagtcatttttaatgaagaggggggatgtc 
atagattactcctcatcaggggtgtccactaatgaggcctcacccctggaccccatta 
ctgaagaagatgagaaatccgaccagtcaggtagtaagctactcccaggcaagaaatc 
ctcagaacggcccagtctcttccagacagacttgaagcttaaggggggtggtcttcgc 
taccagaaactgcccagtgatgaagatgagtcggggacagaagagtcagataacaccc 
cactgctcaaagatgataaggacaaaaaggctgaaggcaaagcagagagagtgtgcaa 
gtccccagagcacagtgccgagcccatcag2\acgttcattaaagcaaaagagtatttg 
tccgatgccctccttgacaaaaaggattcctcagattcgggagtgaggtccaatgaga 
gttctcccaaccactctctccacaatgaggcagcagacgactcccaactggaaaaggc 
aaacctcatagagcttgaagatgagggccacagtgggaagcgggggatgccacacagt 
ctgagtggcttgcaagatccaattatagctcggatgtccatttgctcggaagacaaga 
aaagcccttccgagtgcagcctgattgctagcagccctgaagaaagctggcctgcgtg 
ccagaaggc ctacaatct aaaccgaacgcccagcact gt gact ct gaacaacaacact 
gcacccaccaacagagccaatcagaacttcgatgagatagagggaatcagggagacgt 
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CCCAGGTCATTCTGAGGCCTGGCCCCAGTCCCAACCCAACTGCTGTGCAGAATGAAAA 
CCTGAAGAGCATGGCGCACAAGCGAAGCCAGCGCTCCAGTTACACAAGGCTCTCGAAA 
GATGCATCTGAGCTACATGCAGCCTCCTCGGAGAGCACAGGCTTTGGAGAAGAGAGAG 
AGAGCATTCTC 

Stop codon and 3 "^untranslated 

tgaggaaggagaagtgaaaagagtgtgacagactttgagttgaccattcctcattactcatca 
tgtttatgcatcacagacacctgataggtggtcaggcactggaagcagacaggaaagcctgaa 
agcacagtcaccagtcattttcattgtcaccgtcatcccaggcactttaaagtctcactgcca 
ctattacacctttcctggcacacagctaggcagagcgatgagagcaggctgaagacaaaatag 
gtcaaaagtgtatttaaagttgaaagggcttaaatcacaaggaaaaaaacttagagatctcct 
ttataaactattcatgtaccatttgagggaagatacagtgttagatgaagcagaaaccaattt 
tattgttagtgtttaatcttggtgcagatttataagttttagagtagcccagagactaaaagt 
gaatacttagcaaatggatagccagtgttctatataggaatcattgcttttcagagggcttaa 
agtttaagtagaaaatatatactcaagaaggcgataaaagctgatgagaaagtgagttagcag 
aacccaaagccgtgctgggccgcggtgactcattagcagaggaggaggggagggcagtatatt 
cctgggatactctctccagacccagcctggcttctgacatcatccacctgtgccctcaaaacc 
gtcttagtctgttctgcaactcttaagtgacatacctaactcagctcatggctaaggaaaaaa 
aattaaagttgtcctggtgattaaactctggacctcccacatctaagtcccgagttgacaaac 
tgcatccccagcccaaacatctgttcttccagagatctggaaagacatgggca'taagagtcca 
gtaaactcttaatgaaggtgtcgggaagtgtttcccttaaaggggttagtgacagtgacacta 
gacaagtgacagaatttagtttgctgacattaaatcactgccttctgggccgcaatagtttag 
tatctgttcctaaatcagcatttgagttctgcgggggttacaaaatggaggcatggttttaag 
aaggaatttgaaagcatccagggaagcaagaacagagtcggagctggtgtgaccatgacactt 
cccattattgttagcaaagctaggagaatgaccacatacatttaagtatcatgtaggctagag 
cgtgcctgtcatcatcatgcattgagaccgtttgtcctttgattatagtgtgtttatgtagac 
cagtcctagcagtatgtcctgagccatgggcaggattgtagatggcgaggtggcaggtttagt 
gctcgttgagatgtccatctctgactggacctgtgtggtttacttgtagctgctccgtgaaac 
actgccttgctgtttgagtgtaattgagcattttcatgaagtgtatatgttgcatgatcttta 
acaggtatttttagacattaagaacagcatgatattactgacatatgcattagtagtatgcag 
tatcatgtaatgggtacttcagaggcagtttggtttctttccaggtgagtagttatagctcac 
agtcatgggaatcctttattcacactgtctgtgaagtcgatgctggagtgccaaatgcacacg 
gagactaggaacaataaagacttgcatgccattctgactcatggagttttattcaatagagca 
atgataattcccattctgattaaaaattaaaacacggaa 
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Figure 11 (page 1 of 2) 

Htaman__Kidins Protein Sequence 

Length: 1715 June 29, 2000 14:41 Type: P 

1 MSVLISQSVI NYVEEENIPA LKALLEKGKD VDERNECGQT PLMIAAEQGN 

51 LEIVKELIKN GANCNLEDLD NWTALISASK EGHVHIVEEL LKCGVNLEHR 

101 DMGGWTALMW ACYKGRTDW ELLLSHGANP SVTGLQYSVY PIIWAAGRGH 

151 ADIVHLLLQN GAKVNCSDKY GTTPLVWAAR KGHLECVKHL LAMGADVDQE 

201 GANSMTALIV AVKGGYTQSV KEILKRNPNV NLTDKDGNTA LMIASKEGHT 

251 EIVQDLLDAG TYVNIPDRSG DTVLIGAVRG GHVEIVRALL QKYADIDIRG 

301 QDNKTALYWA VEKGNATMVR DILQCNPDTE ICTKDGETPL IKATKMRNIE 

351 WELLLDKGA KVSAVDKKGD TPLHIAIRGR SRKLAELLLR NPKDGRLLYR 

401 PNKAGETPYN IDCSHQKSIL TQIFGARHLS PTETDGDMLG YDLYSSALAD 

451 ILSEPTMQPP IGVGLYAQWG SGKSFLLKKL EDEMKTFAGQ QIEPLFQFSW 

501 LIVFLTLLLC GGLGLLFAFT VHPNLGIAVS LSFLALLYIF FIVIYFGGRR 

551 EGESWNWAWV LSTRLARHIG YLELLLKLMF VNPPELPEQT TKALPVRFLF 

601 TDYNRLSSVG GETSLAEMIA TLSDACEREF GFLATRLFRV FKTEDTQGKK 

651 KWKKTCCLPS FVIFLFIIGC IISGITLLAI FRVDPKHLTV NAVLISIASV 

701 VGLAFVLNCR TWWQVLDSLL NSQRKRLHNA ASKLHKLKSE GFMKVLKCEV 

751 ELMARMAKTI DSFTQNQTRL WIIDGLDAC EQDKVLQMLD TVRVLFSKGP 

801 FIAIFASDPH IIIKAINQNL NSVLRDSNIN GHDYMRNIVH LPVFLNSRGL 

851 SNARKFLVTS ATNGDVPCSD TTGIQEDADR RVSQNSLGEM TKLGSKTALN 

901 RRDTYRRRQM QRTITRQMSF DLTKLLVTED WFSDISPQTM RRLLNIVSVT 

951 GRLLRANQIS FNWDRLASWI NLTEQWPYRT SWLILYLEET EGIPDQMTLK 

1001 TIYERISKNI PTTKDVEPLL EIDGDIRNFE VFLSSRTPVL VARDVKVFLP 

1051 CTVNLDPKLR EIIADVRAAR EQISIGGLAY PPLPLHEGPP RAPSGYSQPP 
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Figure 



11 (page 2 of 2) 



1101 SVCSSTSFNG PFAGGVVSPQ PHSSYYSGMT GPQHPFYNRG SGPAPGPWL 

1151 LNSLNVDAVC EKLKQIEGLD QSMLPQYCTT IKKANINGRV LAQCNIDELK 

1201 KEMNMNFGDW HLFRSTVLEM RNAESHVVPE DPRFLSESSS GPAPHGEPAR 

1251 RASHNELPHT ELSSQTPYTL NFSFEELNTL GLDEGAPRHS NLSWQSQTRR 

1301 TPSLSSLNSQ DSSIEISKLT DKVQAEYRDA YREYIAQMSQ LEGGPGSTT.I 

1351 SGRSSPHSTY YMGQSSSGGS IHSNLEQEKG KDSEPKPDDG RKSFLMKRGD 

1401 VIDYSSSGVS TNDASPLDPI TEEDEKSDQS GSKLLPGKKS SERSSLFQTD 

1451 LKLKGSGLRY QKLPSDEDES GTEESDNTPL LKDDKDRKAE GKVERVPKSP 

1501 EHSAEPIRTF IKAKEYLSDA LLDKKDSSDS GVRSSESSPN HSLHNEVADD 

1551 SQLEKANLIE LEDDSHSGKR GIPHSLSGLQ DPIIARMSIC SEDKKSPSEC 

1601 SLIASSPEEN WPACQKAYNL NRTPSTVTLN NNSAPANR2\N QNFDEMEGIR 

1651 ETSQVILRPS SSPNPTTIQN ENLKSMTHKR SQRSSYTRLS KDPPELHAAA 

17 01 SSESTGFGEE RESIL 
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Htunan Kidins DNA Sequence 

Length: 7095 July 4, 2000 16:54 Type: N 



1 


ggaagaatat 


ggccgccggg 


tgtggtgagg 


gcgacgcgct 


tgcagtcgcc 


51 


gtctcttgct 


tccccgtcct 


ctgacatcgc 


ctgcagccga 


gcgggcccgt 


101 


tccgccggag 


ctgaggacca 


ggtattcaaa 


taaagttaat 


tgcagctttc 


151 


tgtgaaaATG 


TCAGTTTTGA 


TATCACAGAG 


CGTCATAAAT 


TATGTAGAGG 


201 


AAGAAAACAT 


TCCTGCTCTG 


AAAGCTCTTC 


TTGAAAAATG 


CAAAGATGTA 


251 


GATGAGAGAA 


ATGAGTGTGG 


CCAGACTCCA 


CTGATGATAG 


CTGCCGAACA 


301 


AGGCAATCTG 


GAAATAGTGA 


AGGAATTAAT 


TAAGAATGGA 


GCTAACTGCA 


351 


ATCTGGAAGA 


TTTGGATAAT 


TGGACAGCAC 


TTATATCTGC 


ATCGAAAGAA 


401 


GGGCATGTGC 


ACATCGTAGA 


GGAACTACTG 


AAATGTGGGG 


TTAACTTGGA 


451 


GCACCGTGAT 


ATGGGAGGAT 


GGACAGCTCT 


TATGTGGGCA 


TGTTACAAAG 


501 


GCCGTACTGA 


CGTAGTAGAG 


TTGCTTCTTT 


CTCATGGTGC 


CAATCCAAGT 


551 


GTCACTGGTC 


TGCAGTACAG 


TGTTTACCCA 


ATCATTTGGG 


cagcagggAg 


601 


AGGCCATGCA 


GATATAGTTC 


ATCTTTTACT 


GCAAAATGGT 


GCTAAAGTCA 


651 


ACTGCTCTGA 


TAAGTATGGA 


ACCACCCCTT 


TAGTTTGGGC 


TGCACGAAAG 


701 


GGTCATTTGG 


AATGTGTGAA 


ACATTTATTG 


GCCATGGGAG 


CTGATGTGGA 


751 


TCAAGAAGGA 


GCTAATTCAA 


TGACTGCACT 


TATTGTGGCA 


GTGAAAGGAG 


801 


GTTACACACA 


GTCAGTAAAA 


GAAATTTTGA 


AGAGGAATCC 


AAATGTAAAC 


851 


TTAACAGATA 


AAGATGGAAA 


TACAGCTTTG 


ATGATTGCAT 


CAAAGGAGGG 


901 


ACATACGGAG 


ATTGTGCAGG 


ATCTGCTCGA 


CGCTGGAACA 


TATGTGAACA 


951 


TACCTGACAG 


GAGTGGGGAT 


ACTGTGTTGA 


TTGGCGCTGT 


CAGAGGTGGT 


1001 


CATGTTGAAA 


TTGTTCGAGC 


GCTTCTCCAA 


AAATATGCTG 


ATATAGACAT 


1051 


TAGAGGACAG 


GATAATAAAA 


CTGCTTTGTA 


TTGGGCTGTT 


GAGAAAGGAA 


1101 


ATGCAACAAT 


GGTGAGAGAT 


ATCTTACAGT 


GCAATCCTGA 


CACTGAAATA 
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1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 
1801 
1851 
1901 
1951 
2001 
2051 
2101 
2151 
2201 
2251 
2301 
2351 



TGCACAAAGG ATGGTGAAAC GCCACTTATA AAGGCTACCA AGATGAGAAA 
CATTGAAGTG GTGGAGCTGC TGCTAGATAA AGGTGCTAAA GTGTCTGCTG 
TAGATAAGAA AGGAGATACT CCCTTGCATA TTGCTATTCG TGGAAGGAGC 
CGGAAACTGG CAGAACTGCT TTTAAGAAAT CCCAAAGATG GGCGATTACT 
TTATAGGCCC AACAAAGCAG GCGAGACTCC TTATAATATT GACTGTAGCC 
ATCAGAAGAG TATTTTAACT CAAATATTTG GAGCCAGACA CTTGTCTCCT 
ACTGAAACAG ACGGTGACAT GCTTGGATAT GATTTATATA GCAGTGCCCT 
GGCAGATATT CTCAGTGAGC CTACCATGCA GCCACCCATT TGTGTGGGGT 
TATATGCACA GTGGGGAAGT GGGAAATCTT TCTTACTCAA 'GAAACTAGAA 
GACGAAATGA AAACCTTCGC CGGACAACAG ATTGAGCCTC TCTTTCAGTT 
CTCATGGCTC ATAGTGTTTC TTACCCTGCT ACTTTGTGGA GGGCTTGGTT 
TATTGTTTGC CTTCACGGTC CACCCAAATC TTGGAATAGC AGTGTCACTG 
AGCTTCTTGG CTCTCTTATA TATATTCTTT ATTGTCATTT ACTTTGGTGG 
ACGAAGAGAA GGAGAGAGTT GGAATTGGGC CTGGGTCCTC AGCACTAGAT 
TGGCAAGACA TATTGGATAT TTGGAACTCC TCCTTAAATT GATGTTTGTG 
AATCCACCTG AGTTGCCAGA GCAGACTACT AAAGCTTTAC CTGTGAGGTT 
TTTGTTTACA GATTACAATA GACTGTCCAG TGTAGGTGGA GAAACTTCTC 
TGGCTGAAAT GATTGCAACC CTCTCGGATG CTTGTGAAAG AGAGTTTGGC 
TTTTTGGCAA CCAGGCTTTT TCGAGTATTC AAGACTGAAG ATACTCAGGG 
TAAAAAGAAA TGGAAA2\AAA CATGTTGTCT CCCATCTTTT GTCATCTTCC 
TTTTTATCAT TGGCTGCATT ATATCTGGAA TTACTCTTCT GGCTATATTT 
•AGAGTTGACC CAAAGCATCT GACTGTAAAT GCTGTCCTCA TATCAATCGC 
ATCTGTAGTG GGATTGGCCT TTGTGTTGAA CTGTCGTACA TGGTGGCAAG 
TGCTGGACTC GCTCCTGAAT TCCCAAAGAA AACGCCTCCA TAATGCAGCC 
TCCAAACTGC ACAAATTGAA AAGTGAAGGA TTCATGAAAG TTCTTAAATG 
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2401 TGAAGTGGAA TTGATGGCCA GGATGGCAAA AACCATTGAC AGCTTCACTC 

2451 AGAATCAGAC AAGGCTGGTG GTCATCATCG ATGGATTAGA TGCCTGTGAG 

2501 CAGGACAAAG TCCTTCAGAT GCTGGACACT GTCCGAGTTC TGTTTTCAAA 

2551 AGGCCCGTTC ATTGCCATTT TTGCAAGTGA TCCACATATT ATCATAAAGG 

2601 CAATTAACCA GAACCTCAAT AGTGTGCTTC GGGATTCAAA TATAAATGGC 

2651 CATGACTACA TGCGCAACAT AGTCCACTTG CCTGTGTTCC TTAATAGTCG 

2701 TGGACTAAGC AATGCAAGAA AATTTCTCGT AACTTCAGCA ACAAATGGAG 

2751 ACGTTCCATG CTCAGATACT ACAGGGATAC AGGAAGATGC TGACAGAAGA 

2801 GTTTCACAGA ACAGCCTTGG GGAGATGACA AAACTTGGTA GCAAGACAGC 

2851 CCTCAATAGA CGGGACACTT ACCGAAGAAG GCAGATGCAG AGGACCATCA 

2901 CTCGCCAGAT GTCCTTTGAT CTTACAAAAC TGCTGGTTAC CGAGGACTGG 

2951 TTCAGTGACA TCAGTCCCCA GACCATGAGA AGATTACTTA ATATTGTTTC 

3001 TGTGACAGGA CGATTACTGA GAGCCAATCA GATTAGTTTC AACTGGGACA 

3051 GGCTTGCTAG CTGGATCAAC CTTACTGAGC AGTGGCCATA CCGGACTTCA 

3101 TGGCTCATAT TATATTTGGA AGAGACTGAA GGTATTCCAG ATCAAATGAC 

3151 ATTAAAAACC ATCTACGAAA GAATATCAAA GAATATTCCA ACAACTAAGG 

3201 ATGTTGAGCC ACTTCTTGAA ATTGATGGAG ATATAAGAAA TTTTGAAGTG 

3251 TTTTTGTCTT CAAGGACCCC AGTTCTTGTG GCTCGAGATG TAAAAGTCTT 

3301 TTTGCCATGC ACTGTAAACC TAGATCCCAA ACTACGGGAA ATTATTGCAG 

3351 ATGTTCGTGC TGCCAGAGAG CAGATCAGTA TTGGAGGACT GGCGTACCCC 

3401 CCGCTCCCTC TACATGAGGG TCCTCCTAGG GCGCCATCAG GGTACAGCCA 

3451 GCCCCCATCC GTGTGCTCTT CCACGTCCTT CAATGGGCCC TTCGCAGGTG 

3501 GAGTGGTGTC ACCACAGCCT CACAGCAGCT ATTACAGCGG CATGACGGGC 

3551 CCTCAGCATC CCTTCTACAA CAGGGGGTCA GGCCCAGCCC CAGGCCCAGT 

3601 GGTATTACTG AATTCACTGA ATGTGGATGC AGTATGTGAG AAGCTGAAAC 
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3651 

3701 

3751 

3801 

3851 

3901 

3951 

4001 

4051 

4101 

4151 

4201 

4251 

4301 

4351 

4401 

4451 

4501 

4551 

4601 

4651 

4701 

4751 

4801 

4851 



AAATAGAAGG GCTGGACCAG AGTATGCTGC CTCAGTATTG TACCACGATC 
AAAAAGGCAA ACATAAATGG CCGTGTGTTA GCTCAGTGTA ACATTGATGA 
GCTGAAGAAA GAGATGAATA TGAATTTTGG AGACTGGCAC CTTTTCAGAA 
GCACAGTACT AGAAATGAGA AACGCAGAAA GCCACGTGGT CCCTGAAGAC 
CCACGTTTCC TCAGTGAGAG CAGCAGTGGC CCAGCCCCGC ACGGTGAGCC 
TGCTCGCCGC GCTTCCCACA ACGAGCTGCC TCACACCGAG CTCTCCAGCC 
AGACGCCCTA CACACTCAAC TTCAGCTTCG AAGAGCTGAA CACGCTTGGC 
CTGGATGAAG GTGCCCCTCG TCACAGTAAT CTAAGTTGGC AGTCACAAAC 
TCGCAGAACC CCAAGTCTTT CGAGTCTCAA TTCCCAGGAT TCCAGTATTG 
AAATTTCAAA GCTTACTGAT AAGGTGCAGG CCGAGTATAG AGATGCCTAT 
AGAGAATACA TTGCTCAGAT GTCCCAGTTA GAAGGGGGCC CCGGGTCTAC 
AACCATTAGT GGCAGATCTT CTCCACATAG CACATATTAC ATGGGTCAGA 
GTTCATCAGG GGGCTCTATT CATTCAAACC TAGAGCAAGA AAAGGGGAAG 
GATAGTGAAC CAAAGCCCGA TGATGGGAGG AAGTCCTTTC TAATGAAGAG 
GGGAGATGTT ATCGATTATT CATCATCAGG GGTTTCCACC AACGATGCTT 
CCCCCCTGGA TCCTATCACT GAAGAAGATG A2\A2\ATCAGA TCAGTCAGGC 
AGTAAGCTTC TCCCAGGCAA GAAATCTTCC GAAAGGTCAA GCCTCTTCCA 
GACAGATTTG AAGCTTAAGG GAAGTGGGCT GCGCTATCAA AAACTCCC2\A 
GTGACGAGGA TGAATCTGGC ACAGAAGAAT CAGATAACAC TCCACTGCTC 
AAAGATGACA AAGACAGAAA AGCCGAAGGG AAAGTAGAGA GAGTGCCGAA 
GTCTCCAGAA CACAGTGCTG AGCCGATCAG AACCTTCATT AAAGCCAAAG 
AGTATTTATC GGATGCGCTC CTTGACAAAA AGGATTCATC GGATTCAGGA 
GTGAGATCCA GTGAAAGTTC TCCCAATCAC TCTCTGCACA ATGAAGTGGC 
GGATGACTCC CAGCTTGAAA AGGCAAATCT CATAGAGCTG GAAGATGACA 
GTCACAGCGG AAAGCGGGGA ATCCCACATA GCCTGAGTGG CCTGCAAGAT 
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4901 CCAATTATAG CTCGGATGTC CATTTGTTCA GAAGACAAGA AAAGCCCTTC 
4951 CGAATGCAGC TTGATAGCCA GCAGCCCTGA AGAAAACTGG CCTGCATGCC 
5001 AGAAAGCCTA CAACCTGAAC CGAACTCCCA GCACCGTGAC TCTGAACAAC 
5051 AATAGTGCTC CAGCCAACAG AGCCAATCAA AATTTCGATG AGATGGAGGG 
5101 AATTAGGGAG ACTTCTCAAG TCATTTTGAG GCCTAGTTCC AGTCCCAACC 
5151 CAACCACTAT TCAGAATGAG AATCTAAAAA GCATGACACA TAAGCGAAGC 
5201 CAACGTTCAA GTTACACAAG GCTCTCCAAA GATCCTCCGG AGCTCCATGC 
5251 AGCAGCCTCT TCTGAGAGCA CAGGCTTTGG AGAAGAAAGA GAAAGCATTC 
5301 TTtgagaaaa acaagcaaag gagaagagtg ttactgtacc cttatgacag 
5351 aattgtcctg gattttgact ccatccacgc ccatcacctt tctacatttt 
5401 gctgacagat aactaaccga tgatgaggcc gaggtaaaag agacatctgc 
5451 agtgtgacag aagggagcat gagaagcatg gctcaccagc cagcctctgt 
5501 ggtctttgta attagaagct tcagaactca ctaatactac tgtacctttc 
5551 attggcgcat taccccataa aactttttga gacgaggtga gatctgagta 
5601 taaagatagg tcagaagtat tttaaagggc ttaatgtgcc aaaaagaaaa 
5651 aaagctagag accctttttg caaacatttg gtgaccacac atttgaggga 
5701 agacgtggcg ttaggtgaag cagaagcaaa ccctgctctt aggggctcac 
5751 ctaggtgagt gcacagcctg tgacgctaca gggagaggct gagtaaaccg 
5801 agatccagcg ttctgtatgg caggggtatt gcttatcaca gaggttctga 
5851 agagtaggaa gtacataatg aagagggctt taaaaattgc caacaaagtg 
5901 agtcaccagg gctggcagta gtgtgacggg gctgtcctga gctgttagga 
5951 gagtagatgc ggggagggct ggtgacctcc gtgggtttat atgtcggaaa 
6001 ctcttctctc caaatcccag gcctggcttc cagcaccatc cagctgtgcc 
6051 caagaagcca ccctggtctg ttctccaact cttttaaatg gtgcccaact 
6101 tttctaagtg agcttagcaa tgagaagaaa aaaaaacatg aattcttttt 
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6151 


ctggaaaatc 


agggagacat 


gggtaataat 


aggtactaat 


aaatatttat 


6201 


agatgagtga 


atgaggaaat 


aattacatca 


aaaaggtcag 


tgacaattga 


6251 


taaatgacaa 


ggaaatattt 


aattaggtaa 


aactaaatca 


ttgctctcta 


6301 


tactaggata 


gactttatct 


acttcatctg 


ttcctaagtc 


agcatgttag 


6351 


ttctggggaa 


ggatcataag 


aaaggaaata 


ctttttaaaa 


aaaaatttgg 


6401 


aaacatgtaa 


caaagcaagg 


gtaaaatata 


tatatatatc 


tatataagtg 


6451 


ctgtgactgt 


aaaagtgtac 


tttccattaa 


ttattagccg 


agttaagaga 


6501 


atggtcacat 


tgaagtactg 


tgtggactag 


aaatgtaccc 


tgtcatcatg 


6551 


caatgaaata 


ttgttatcgt 


tttaacatag 


ctcatttatg 


tagaatgaat 


6601 


tctggtggtt 


taccccaagt 


cacagttagg 


acggtagatg 


gtgagatcgc 


6651 


agatgcgcta 


ttatctagat 


tcagtgttac 


attttcgatg 


tttatcactc 


6701 


agtgggtttt 


tattaatatg 


ctgattaagt 


tatttactgg 


gccagtcatt 


6751 


gtgctaaata 


gttgctcttt 


tgtgtttcat 


tgccttgatg 


tttgagtgta 


6801 


atctagcatt 


ttaatacagt 


gtttattttg 


catgatcttt 


aacaaatgtt 


bo o 1 


ttaagcaatt 


ttaaaaaggc 


aggatgttat 


tgacattata 


cactgaagtc 


6901 


ttaacatttt 


aacatttata 


gtgcttattt 


gcaaaatHgt 


ataattagga 


6951 


attatttcag 


agacaatgtt 


ttctttttca 


ggtgagtagt 


tgccgcgtaa 


7001 


tatcattgga 


gtacattctt 


tatactgttt 


gtgaaattaa 


tactagcata 


7051 


ttaagtgtac 


aaatagattt 


agaaaacaat 


aaaaaattgc 


atgct 
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Figure 15 (page 1 of 13) 

BESTFIT of: HumanJKidins check: 8556 from: 1 to: 7095 

to: RatJCldins check: 131 from: 1 to: 7140 

Gap Weight: 50 Average Match: 10.000 

Length Weight: 3 Average Mismatch: -9.000 

Quality: 39460 Length: 7318 

Ratio: 5.614 ' Gaps: 59 

Percent Similarity: 81.558 Percent Identity: 81.558 

Match display thresholds for the alignment (s) : 
I = IDENTITY 
: = 5 
. = 1 

Human Kidins x Rat Kidins 



66 gtcctctgacatcgcctgcagccgagcgggcccgttccgccggagctgag 115 
I I I I I I I I i I I I I I I t M I I i I I III I I II I I II 
1 gtcctctgacaaggtctgcggccctgtgagctggct . ctacagagccgag 4 9 
• • . . • 

116 gaccaggtattc aaataaagttaattgcag 145 

I i I I I I I I I I I I I I II 

50 ggcggagtcctcagccaggccgagatacggtgaattaaagtcggcaatag 99 

146 ctttctgtgaaaATGTCAGTTTTGATATCACAGAGCGTCATAAATTATGT 195 

I I 11 I I I I I I i i I I II I I I i I I I I 11 I I I I II II II I I I I I I II I 
100 atttctgtgaaaATGTCAGTTCTTATATCACAGAGTGTGATAAATTATGT 149 

196 AGAGGAAGAAAACATTCCTGCTCTGAAAGCTCTTCTTGAAAAATGCAAAG 245 

li III il II I I II i i I II I II I II II I II I i II I I II I M I I I I 
150 GGAAGAAGAGAATATTCCTGCTCTGAAAGCTCTTCTTGAGAAGTGCAAAG 199 

246 ATGTAGATGAGAGAAATGAGTGTGGCCAGACTCCACTGATGATAGCTGCC 295 

I I II I I I I I II II II I I I I I I I I I II I i I II MINI I I I I I I 

200 ATGTAGATGAGAGGAACGAGTGTGGCCAGACGCCGCTGATGCTGGCTGCG 249 

296 GAACAAGGCAATCTGGAAATAGTGAAGGAATTAATTAAGAATGGAGCTAA 345 

II II I 11 1 I I I I II I I I II I I I I III I I I I I I I M I I I i II 

250 GAGCAGGGCAATGTGGAAATAGTGAAAGAACTGTTAAAGAATGGAGCCAA 299 

346 CTGCAATCTGGAAGATTTGGATAATTGGACAGCACTTATATCTGCATCGA 395 

I II I II I I I I 1 I II I I I I I I I II I I I II II I I I I I I I I I I I I I I 
300 CTGCAATCTGGAAGATCTGGATAACTGGACTGCCCTTATATCGGCATCTA 34 9 
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Figure 15 (page 2 of 13) 

396 AAGAAGGGCATGTGCACATCGTAGAGGAACTACTGAAATGTGGGGTTAAC 445 

I I I I I I i I I I I I I I i I I I I I I I I I I I I I I I I I I I 

350 AGGAGGGGCACATCCACATTGTGGAGGAGCTGCTGAAGAGCGGGGCCAGT 399 

• • - . , 

44 6 TTGGAGCACCGTGATATGGGAGGATGGACAGCTCTTATGTGGGCATGTTA 495 

i I I I I I I I I i 11 I I I I I I I I I I I I I I I I I I I I I I I i 1 I I II II 
400 CTGGAGCACCGAGACATGGGTGGATGGACAGCTCTCATGTGGGCGTGCTA 449 

• . , 

4 96 CAAAGGCCGTACTGACGTAGTAGAGTTGCTTCTTTCTCATGGTGCCAATC 545 

II I I I I I I II I I I I I 11 III I II I I I I I M I I II I I M I I I I I 

450 CAAAGGCAGGACCGACGTGGTCGAGCTGCTTCTTTCTCATGGTGCCAACC 499 

• ... 

546 CAAGTGTCACTGGTCTGCAGTACAGTGTTTACCCAATCATTTGGGCAGCA 595 

II I I I I I I I II I II I I I I I I I Mill I II 11 I I I I I I I II I I I I i 
500 CAAGTGTCACTGGTTTGCAGTATAGTGTCTACCCAATCATCTGGGCAGCG 549 

• • • a • 

596 GGGAGAGGCCATGCAGATATAGTTCATCTTTTACTGCAAAATGGTGCTAA 645 

II I I I I I II I I I I II I I I I I I M I I I I M I I I I II I I I I I I II 
550 GGCAGAGGCCATGCCGACATAGTGCATCTTCTACTGCAGAATGGTGCGAA 599 

• • • • • 

64 6 AGTCAACTGCTCTGATAAGTATGGAACCACCCCTTTAGTTTGGGCTGCAC 695 

1 I I I I I Mill II Mill II I II II I 11 II II II II II M I 
600 AGTCAATTGCTCCGACAAGTACGGAACAACCCCCTTGGTATGGGCTGCCC 64 9 

• • • 

696 GAAAGGGTCATTTGGAATGTGTGAAACATTTATTGGCCATGGGAGCTGAT 745 

I M M M I M I N II M I II I II II II II M M M M I I I M I I t II 
650 GAAAGGGTCATTTGGAATGTGTGAAGCACTTGTTGGCCATGGGAGCTGAT 699 
• 

74 6 GTGGATCAAGAAGGAGCTAATTCAATGACTGCACTTATTGTGGCAGTGAA 795 

If i I I I I I I I I i I I I i I I I I I I I I i I I I I I I I i I I I I I i I I I I 1 I 
700 GTTGATCAAGAAGGAGCCAATTCGACGACTGCACTTATCGTGGCAGTGAA 749 

• • • • • 

796 AGGAGGTTACACACAGTCAGTAAAAGAAATTTTGAAGAGGAATCCAAATG 845 

I i I I I I 1 I I I I i I I I I I II I I I I I II I I I I II I I I II I I I I I I II 
750 AGGAGGCTACACACAGTCCGTGAAAGAGATTTTGAAAAGGAATCCAAATG 799 

• • • • . 

84 6 TAAACTTAACAGATAAAGATGGAAATACAGCTTTGATGATTGCATCAAAG 895 

i Ml II I I II II I I I I I I I I I I I I I I I i I I I I I I I I I I Ml 
800 TGAACCTAACTGACAAGGATGGAAACACAGCTCTGATGATCGCATCGAAG 849 

• • . • • • 

896 GAGGGACATACGGAGATTGTGCAGGATCTGCTCGACGCTGGAACATATGT 945 

I I I I M M M I M I M I M II M I 11 I II II II I M I I I M I II 
850 GAGGGACATATTGAGATTGTGCAGGACCTGCTGGACGCCGGAACATACGT 899 

■ • - • • 

946 GAACATACCTGACAGGAGTGGGGATACTGTGTTGATTGGCGCTGTCAGAG 995 

M M M M I I II I I M II N M M I I I II I I I M i M II I I II I II I 
900 GAACATACCTGACAGGAGTGGGGATACTGTATTGATCGGCGCTGTGAGAG 94 9 
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Figure 15 (page 3 of 13) 

996 GTGGTCATGTTGAAATTGTTCGAGCGCTTCTCCAAAAATATGCTGATATA 1045 

I i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i n I I I I I I I I I I I I I I I I I t I I I I I I I I I I I 

950 GTGGTCATGTTGAAATTGTTCGAGCACTTCTCCAAAAATATGCTGATATA 999 
1046 GACATTAGAGGACAGGATAATAAAACTGCTTTGTATTGGGCTGTTGAGAA 1095 

I I I i I I I I I I I I I I M I I I I I II II I I I I I I I I I I I I I I I II 

1000 GACATTAGAGGGCAGGACAATAAGACCGCGCTGTATTGGGCCGTTGAAAA 1049 

• • ■ • • 

1096 AGGAAATGCAACAATGGTGAGAGATATCTTACAGTGCAATCCTGACACTG 1145 

I I I I I { M I II II II I I II I I I II I I I M I I I I I II I Mill I 
1050 GGGAAATGCGACAATGGTGAGGGATATCCTGCAGTGCAATCCGGACACGG 1099 

• • ■ • • 

1146 AAATATGCACAAAGGATGGTGAAACGCCACTTATAAAGGCTACCAAGATG 1195 

II I I I I I I I I I II I II I I II I II II I II I I I I I II I I I I I i I I I 
1100 AAATATGCACGAAGGATGGTGAGACACCACTGATAAAAGCCACCAAGATG 1149 

• • ■ • • 

1196 AGAAACATTGAAGTGGTGGAGCTGCTGCTAGATAAAGGTGCTAAAGTGTC 1245 

i I I i I i I i I I M II i I I I I I I I li I I i I I I I I II I I I II I i i I I 
1150 AGAAATACTGAAGTGGTGGAGCTGCTGCTGGATAAGGGGGCTAAGGTGTt 1199 

• » ■ « • 

1246 TGCTGTAGATAAGAAAGGAGATACTCCCTTGCATATTGCTATTCGTGGAA 1295 

Mini M 11 II M I I II II Ml I I I 11 II I I II II II I 
1200 TGCTGTGGACAAGAAAGGGGACACTCCCCTGCACGTTGCCATCCGAGGGA 1249 

• • • • • 

1296 GGAGCCGGAAACTGGCAGAACTGCTTTTAAGAAATCCCAAAGATGGGCGA 1345 

II II II M I M II II III II II II II I II II II II I I i I i II 
1250 GGAGCCGGAGACTGGCCGAACTTCTTTTACGGAACCCCAAAGATGGACGG 1299 

134 6 TTACTTTATAGGCCCAACAAAGCAGGCGAGACTCCTTATAATATTGACTG 1395 

II I II II II I I II II II II I II II II II II I I II II M II II M 

1300 TTACTCTACAGACCCAACAAAGCAGGCGAGACTCCCTACJiACATTGACTG 1349 

> • • • • 

1396 TAGCCATCAGAAGAGTATTTTAACTCAAATATTTGGAGCCAGACACTTGT 1445 

Mill II I II II II I I I M I M I M I I M I II II I I I I II M M II 
1350 CAGCCACCAGAAAAGCATTTTAACTCAAATATTTGGAGCCAGACACTTGT 1399 

• • « • 

144 6 CTCCTACTGAAACAGACGGTGACATGCTTGGATATGATTTATATAGCAGT 14 95 

I M II M 11 M I 11 I II II M II I II M II II II I I I I I I II I I I 
1400 CTCCTACGGAAACAGATGGCGACATGCTTGGTTATGATTTGTATAGCAGT 1449 

• • • • • 

1496 GCCCTGGCAGATATTCTCAGTGAGCCTACCATGCAGCCACCCATTTGTGT 1545 

I II II II I I I II I II I I II II I II M I I II I I I II I II I I II II II 
1450 GCCCTGGCCGATATTCTCAGTGAACCCACCATGCAACCACCCATTTGTGT 1499 

• • • a • 

1546 GGGGTTATATGCACAGTGGGGAAGTGGGAAATCTTTCTTACTCAAGAAAC 1595 

Ml M M II M II I M II 11 II II I M I I M II II II II I M M I II 
1500 GGGCTTGTATGCACAGTGGGGAAGTGGGAAGTCTTTCTTACTCAAGAAAC 1549 

• • • * • 
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Figure 15 (page 4 of 13) 

1596 TAGAAGACGA?\ATGAAAACCTTCGCCGGACAACAGATTGAGCCTCTCTTT 1645 

I I I I I I I II I I I I I I i i I II II I 11 Nil I I I I I 11)11 
1550 TAGAAGATGAGATGAAGACGTTTGCAGGACAGCAGACCGAGCCCCTCTTC 1599 

• ■ • • . 

164 6 CAGTTCTCATGGCTCATAGTGTTTCTTACCCTGCTACTTTGTGGAGGGCT 1695 

1 1 1 1 M II II 1 1 II 1 1 II I II I II I II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 r 

1600 CAGTTCTCGTGGCTCATAGTGTTCCTCACCCTGCTGCTGTGTGGCGGGCT 164 9 

• - • • 

1696 TGGTTTATTGTTTGCCTTCACGGTCCACCCAAATCTTGGAATAGCAGTGT 174 5 

11 I I I I I I I M I I I I I I i I I I I I I I i I I I I I I II 
1650 CGGCCTAGTGTTTGCTTTCACGGTTGATACAAACCTCGCCATAGCCATTT 1699 
• 

174 6 CACTGAGCTTCTTGGCTCTCTTATATATATTCTTTATTGTCATTTACTTT 1795 

M I I II III M II I I I I I I II I I I I II I I I II II II I I I I I II 
1700 CATTGAGTTTCCTGGCTCTCATATATATATTCTTCATCGTCATTTACTTC 1749 
• 

17 96 GGTGGACGAAGAGAAGGAGAGAGTTGGAATTGGGCCTGGGTCCTCAGCAC 1845 

M I I I I i 1 I I I 11 I II M M II II II I II II I I I I I I I I I I II 
1750 GGTGGACGGCGGGAAGGAGAGAGTTGGAACTGGGCCTGGGCTCTCAGTAC 1799 

• . • 

184 6 TAGATTGGCAAGACATATTGGATATTTGGAACTCCTCCTTAAATTGATGT 1895 

Mi I I i I I I I I I I Mill II MM MINI I III I I I II I 
1800 CAGACTGGCAAGACACATTGGGTACCTGGAGCTCCTCTTCAAACTGATGT 184 9 

• - • . . 
1896 TTGTGAATCCACCTGAGTTGCCAGAGCAGACTACTAAAGCTTTACCTGTG 1945 

I I M I I I I I I i M M 1 i I I I I I I I I M M I I I I M I I I II i 
1850 TTGTGAACCCACCTGAACTTGCGGAGCAAACTACAAAGGCTTTACCTGTG 1899 

194 6 AGGTTTTTGTTTACAGATTACAATAGACTGTCCAGTGTAGGTGGAGAAAC 1995 

I I I I I I I M I M I I I I I I I I I I I I i I I I I I I I i II I I II I I M I I II 
1900 AGGTTTTTGTTTACAGATTACAATAGACTCTCCAGTGTGGGAGGAGAAAC 1949 
• 

1996 TTCTCTGGCTGAAATGATTGCAACCCTCTCGGATGCTTGTGAAAGAGAGT 2045 

ill I I I II I I I II I I I II I II I I I II I I II i I I II II I I I I I I 
1950 TTCCTTGGCTGAAATGATCGCGACCCTCTCAGATGCTTGTGAGAGAGAAT 1999 
• 

204 6 TTGGCTTTTTGGCAACCAGGCTTTTTCGAGTATTCAAGACTGAAGATACT 2095 

I M I I I II I I I I Ml I II I I I I I I I II III M n M II I II 
2000 TCGGCTTTTTGGCTACCCGTCTTTTTCGAGTCCTCAGGACTGAAGAGTCT 2049 

2096 CAGGGTAAAAAGAAATGGAAAAAAACATGTTGTCTCCCATCTTTTGTCAT 2145 

M I M I 11 I II I II I I I I II II I I I I I I II I I I I I I II I I I I I I I I 
2050 CAGGGTAAAAAGAAATGGAAAAGAACATGCTGCCTCCCCTCTTTTGTCAT 2099 

• • • • • 
2146 CTTCCTTTTTATCATTGGCTGCATTATATCTGGAATTACTCTTCTGGCTA 2195 

IN II I 11 II I II I II II II II I I MM 11 II II II M II II 
2100 CTTTCTTTTTATTGTTGGCTGCATTATTGCTGGGATCACCCTTCTGGCTA 2149 

• • • . . 
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2196 TATTTAGAGTTGACCCAAAGCATCTGACTGTAAATGCTGTCCTCATATCA 2245 

I I I I I I I I I I I I I I i II I I I i I I I I II II I I II I I II I I I II 
2150 TATTCAGAGTTGACCCGAAACATCTGACAGTGAATGCTATCCTCATTTCT 2199 

■ • • ■ • 

224 6 ATCGCATCTGTAGTGGGATTGGCCTTTGTGTTGAACTGTCGTACATGGTG 2295 

I i I I I II I II I II I i I I I I M I I I III I II I I I I I I I I I I I I I 
2200 ATCGCATCTGTCGTGGGGTTGGCCTTCGTGCTCAATTGTCGAACATGGTG 2249 

• • • • • 

2296 GCAAGTGCTGGACTCGCTCCTGAATTCCCAAAGAAAACGCCTCCATAATG 2345 

II i M I I I I I I M I I I i I I I I I I I I i II I I I I I I I I I I I I I II I II 

2250 GCAAGTGCTGGACTCTCTCCTGAATTCTCAGAGAAAACGCCTCCATAGTG 2299 

• • • • • 
2346 CAGCCTCCAAACTGCACAAATTGAAAAGTGAAGGATTCATGAAAGTTCTT 2395 

I II II II III I II I M II I M I I I II M I I II I I I I I I I M II 

2300 CTGCCTCCAAATTACACAAGTTAAAAAGTGAAGGATTCATGAAAGTCCTG 234 9 

• • • • • 
2396 AAATGTGAAGTGGAATTGATGGCCAGGATGGCAZU^AACCATTGACAGCTT 2445 

II II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2350 AAGTGTGAGGTGGAGCTGATGGCCAGGATGGCAAAT^CCATTGACAGCTT 2399 

• • • • • 

244 6 CACTCAGAATCAGACAAGGCTGGTGGTCATCATCGATGGATTAGATGCCT 2495 

I I I I I I i I 11 I I I i I I I I I I I I I I I I I I I I II II I I I I I I I I 
2400 CACTCAAAACCAGACAAGGCTGGTGGTTATCATTGACGGCCTGGATGCCT 244 9 

• • • » • 
2496 GTGAGCAGGACAAAGTCCTTCAGATGCTGGACACTGTCCGAGTTCTGTTT 2545 

I II I I I I I I I I I I I I I II II II I I I I I II I I I II I I I I I I I II I 11 I I 
2450 GCGAGCAGGATAAAGTCCTTCAGATGCTGGACACTGTCCGAGTTCTGTTT 24 99 

■ • • • • 
2546 TCAAAAGGCCCGTTCATTGCCATTTTTGCAAGTGATCCACATATTATCAT 2595 

Hill I I I I I II i i I I I I I I I I I I I I I I I I I II M I I I I I I II I II 
2500 TCAAAGGGCCCTTTTATTGCCATTTTTGCAAGTGATCCGCATATTATCAT 2549 

• • • • • 

2596 AAAGGCAATTAACCAGAACCTCAATAGTGTGCTTCGGGATTCAAATATAA 2645 

I 1 II II II I I I I I I I II 11 I II I I M I I I I I I I II i I I I I I I I I I 
2550 AAAGGCCATCAACCAGAACCTGAATAGTGTGCTTCGTGACTCAAATATAA 2599 

• • « • • 

264 6 ATGGCCATGACTACATGCGCAACATAGTCCACTTGCCTGTGTTCCTTT^T 2695 

I 11 II I I I I I I I I I I I II I I I I I I I Mill II II II i I M I I 
260d ACGGACATGACTACATGCGCAATATAGTTCACTTACCAGTTTTTCTTAAT 2649 

• • ■ • * 
2696 AGTCGTGGACTAAGCAATGCAAGAAAATTTCTCGTAACTTCAGCAACAAA 2745 

I I I I I I II II I I I I I I I i I I I I I I I I II I I I I II I I I I I I I II I I 
2650 AGTCGTGGGCTTAGCAATGCAAGGAAATTCCTTGTAACTTCAGCAACAAA 2699 

• ■ • • • 
2746 TGGAGACGTTCCATGCTCAGATACTACAGGGATACAGGAAGATGCTGACA 2795 

III III II I I I I 1 I I I I I II I I I I I I I I I I I I II I I I I I I 
2700 TGGGGACATTACATGCTCAGACACCACAGGAACACAGGAGGACACTGACA 2749 

• • ■ • • 

42/79 

SUBSTITUTE SHEET (RULE 26) 

BNSDOCtD: <WO 02207B6A2_L> 



wo 02/20786 



PCT/GBOl/03977 



Figure 15 (page 6 of 13) 

2796 GAAGAGTTTCACAGAACAGCCTTGGGGAGATGACAAAACTTGGTAGCAAG 2845 

i ) ) I ) I n N I i ) I I ) I I I I j I i ) I I ) I I I I I I I I I I DIN Mill 
2750 GAAGAGTTTCACAGAACAGCCTTGGGGAGATGACAAAGCTTGGGAGCAAA 2799 

284 6 ACAGCCCTCAATAGACGGGACACTTACCGAAGAAGGCAGATGCAGAGGAC 2895 

II II I I I I I III I I I I II II II I II i I i II I I I I I I I I I M II 
2800 ACGGCTCTCAACAGAAGGGACACTTACCGCAGAAGACAGATGCAGAGAAC 2849 
• 

2896 CATCACTCGCCAGATGTCCTTTGATCTTACAAAACTGCTGGTTACCGAGG 2945 

i M I II I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I II I I I I 
2850 CATCACGAGGCAGATGTCCTTTGATCTCACAAAGCTGCTGGTCACTGAGG 2899 
• 

2946 ACTGGTTCAGTGACATCAGTCCCCAGACCATGAGAAGATTACTTAATATT 2995 

I I 11 I M I I I I I I I I I I I II I I I I I II I I II I I I I I I I I I I I I 
2 900 ATTGGTTCAGTGACATCAGCCCTCAGACCATGAGACGGCTACTCAATATT 2949 
• 

2996 GTTTCTGTGACAGGACGATTACTGAGAGCCAATCAGATTAGTTTCAACTG 3045 

I I M I I I I I I I I I I II I I I I I I I I I I I I I I I I I 1 I I I I I I I I II I 
2950 GTTTCTGTGACAGGGCGGCTACTAAGAGCCAATCAGATTACTTTCAACTG 2999 

3046 GGACAGGCTTGCTAGCTGGATCAACCTTACTGAGCAGTGGCCATACCGGA 3095 

N I i I I M I II I I I I I i I I I I I I I I II I I I II I I I I I I I II I II I 
3000 GGACAGGCTAGCCAGCTGGATCAACCTCACCGAGCAGTGGCCGTACCGGA 3049 

3096 CTTCATGGCTCATATTATATTTGGAAGAGACTGAAGGTATTCCAGATCAA 3145 

I M I I I I I III I I I I I I I I I I I I I I I I I I I I I I I II I I I II 
3050 CGTCTTGGCTTATACTATATCTGGAAGAGACTGAAGGTCTCCCTGATCAG 3099 

314 6 ATGACATTAAAAACCATCTACGAAAGAATATCAAAGAATATTCCAACAAC 3195 

I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I 
3100 ATGACCTTAAAGACCATCTATGAGAGAATATCGAAGAATATCCCAACAAC 3149 
• 

3196 TAAGGATGTTGAGCCACTTCTTGAAATTGATGGAGATATAAGAAATTTTG 3245 

IN I I I I I II I I II I I I I I I I I i I I I M I I I II I I II I I I I I I I I I I 
3150 TAAAGATGTTGAGCCGCTTCTTGAAATTGATGGAGATATAAGAAACTTTG 3199 
• 

3246 AAGl'GTTTTTGTCTTCAAGGACCCCAGTTCTTGTGGCTCGAGATGTAAAA 3295 

I " I II II I I M I I I I I II I I I I I I I I I I I I I M { I I I I I t { I I I 
3200 AAGTCTTCTTGTCTTCAAGGACTCCAGTTCTCGTGGCTCGAGATGTAAAG 3249 

3296 GTCTTTTTGCCATGCACTGTAAACCTAGATCCCAAACTACGGGAAATTAT 3345 

- I N I I I I I I Mill II II II I I II M II II II I II I II I M 
3250 ACCTTTTTGCCGTGCACAGTAAACCTGGATCCCAAACTTCGGGAGACCAT 3299 

334 6 TGCAGATGTTCGTGCTGCCAGAGAGCAGATCAGTATTGGAGGACTGGCGT 3395 

I N II II I II II II I M M II M II II I M Mill II II I I 
3300 CGCAGATGTCCGAGCTGCAAGAGAGCAGATCAACATAGGAGGCCTGGCCT 3349 
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3396 ACCCCCCGCTCCCTCTACATGAGGGTCCTCCTAGGGCGCCATCAGGGTAC 34 45 

I I I I I I I I I i I I I I I II II II I II I II 11 I II II I I I t I I 
3350 ATCCCCCGCTCCCTCTGCACGAAGGCCCTCCTCGGCCACCTTCTGGGTAC 3399 

• • • • • 

344 6 AGCCAGCCCCCATCCGTGTGCTCTTCCACGTCCTTCAATGGGCCCTTCGC 3495 

I! I II I I I I I I II I I I I I I I I I II I II I I I I I I I III I 
3400 AGTCAGCCTGCATCTGTCTGCTCTTCAGCATCTTTCAACGGGCCGTTCCC 344 9 

3496 AGGTGGAGTGGTGTCACCACAGCCTCACAGCAGCTATTACAGCGGCATGA 3545 

III II II I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I II 
3450 AGGCGGGGTTGTGTCTCCACAGCCCCACAGCAGTTACTACAGCGGCTTGT 3499 

• •mm* 

3546 CGGGCCCTCAGCATCCCTTCTACAACAGG 3574 

I II II I I I I I I I I I 1 I I I I I I I I I I 
3500 CCGGACCCCAGCACCCCTTCTACAACAGGCCATTCTTTGCCCCATACCTT 3549 



3575 GGGT CAGGCCCAGCCCCAGGCCC 3597 

I I I I I I I I I II I I I II I 

3650 ACTCTGGGTTTAACAAACAGAGACAGGGCAGCCGTCCCTGCCACAGGCAG 3699 

• • • • • 

3598 AGTGGTATTACTGAATTCACTGAATGTGGATGCAGTATGTGAGAAGCTGA 3647 

I I I I I I I I I I I I III I I I I I I I II I I I I I I I I I I I I 
3700 CTCACTGTTACTGAGTTCAATGACCGTGGATGTCGT . TGTGAGAAACTGA 3748 

• • • • • 

3648. Ty^CAAATAGAAGGGCTGGACCAGAGTATGCTGCCTCAGTATTGTACCACG 3697 

III I I I I I I I I I I II I I I M I i I III I I I I I I I I I I II II II 
3749 GACAGATAGAAGGGCTGGACCAGAGCATGATGCCTCAGTACTGCACAACA 3798 

3698 ATCAAAAAGGCAAACATAAATGGCCGTGTGTTAGCTCAGTGTAACATTGA 3747 

I I I I I I I i I I I I I I I I I I I I I I I I I II II i I I I I I I I II I I I I I I 
3799 ATCAAAAAGGCGAACATAAATGGCCGGGTATTGTCTCAGTGTAACATTGA 3848 

• • • • a 

3748 TGAGCTGAAGAAAGAGATGAATATGAATTTTGGAGACTGGCACCTTTTCA 3797 

I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I II UN 
3849 TGAGCTGAAGAAAGAGATGGCTATGAACTTTGGAGACTGGCATCTCTTCA 3898 

37 98 GAAGCACAGTACTAGAAATGAGAAACGCAGAAAGCCACGTGGTCCCTGAA 3847 

I I I I II II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I 
3899 GAAGCATGGTCTTAGAAATGAGAAGTGTGGAAAGCCAGGTGGTCCCTGAA 3948 

• • • • • 

3848 GACCCACGTTTCCTCAGTGAGAGCAGCAGTGGCCCAGCCCCGCACGGTGA 3897 

I I I I I Mill II II III I I I I I I I I II I I II II II II 
3949 GACCCTCGTTTTCTCAATGAAAACTCCAGTGCCCCAGTTCCTCATGGAGA 3998 

• • • • • 
3898 GCCTGCTCGCCGCGCTTCCCACAACGAGCTGCCTCACACCGAGCTCTCCA 3947 

I I I I I II I I I I I I I I I I I I I I I M I I I I I III I I I I I I I I I 
3999 GTCTGCTCGCCGCCCTTCTCACACTGAGCTGCCCCTCACGGAGCTTTCCA 4048 
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3948 GCCAGACGCCCTACACACTCAACTTCAGCTTCGAAGAGCTGAACACGCTT 3997 

I I I I I I I II I II I I II I I I I I I I I I I I I I I I I I I I I I I M I I I I 
4049 GCCAGACTCCATACACCCTGAACTTCAGCTTTGAAGAACTGAACACGCTT 4098 

3998 GGCCTGGATGAAGGTGCCCCTCGTCACAGTAATCTAAGTTGGCAGTCACA 4047 

I II II Mill I t II I I II I I I I M I I II I I I I I I I I I I I I I I I i I I I I 
4099 GGCCTTGATGAGGGTGCCCCTCGTCACAGTAATCTAAGTTGGCAGTCACA 4148 

4048 AACTCGCAGAACCCCAAGTCTTTCGAGTCTCAATTCCCAGGATTCCAGTA 4097 

I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I 
4149 AACTCGCAGAACCCCAAGTCTCTCGAGTCTCAATTCCCAGGACTCCAGTA 4198 

4098 TTGAAATTTCAAAGCTTACTGATAAGGTGCAGGCCGAGTATAGAGATGCC 4147 

II I 11 I II I I II I I I I II I I II I I II I I II II II I M I II I I II II III 
4199 TTGAAATTTCAAAGCTTACTGATAAGGTGCAGGCCGAGTATAGAGACGCC 4248 

4148 TATAGAGAATACATTGCTCAGATGTCCCAGTTAGAAGGGGGCCCCGGGTC 4197 



4249 TATAGAGAGTACATTGCTCAGATGTCCCAGTTAGAAGGGGGCACAGGGTC 4298 
4198 TACAACCATTAGTGGCAGATCTTCTCCACATAGCACATATTACATGGGTC 424 7 



4299 GTCCAC2\ATAAGCGGCAGATCTTCTCCACATAGCACATACTACATAGGCC 434 8 

• • • • k 

4248 AGAGTTCATCAGGGGGCTCTATTCATTCAAACCTAGAGCAAGAAAAGGGG 4297 

MM II M M M M I M M M M I I M M I M MM MM 
434 9 AGAGCTCTTCAGGGGGCTCCATCCATTCTACTCTAGAACAGGAAAGGGGG 4398 

• • • • • 
4298 AAGGATAGTGAACCAAAGCCCGATGATGGGAGGAAGTCCTTTCTAATGAA 4347 

mil I I I I I I I I I I II I I I I I I I I I I I I I III I I I I I I I 
4399 AAGGAGGGTGAGCTAAAGCAGGAGGATGGGCGGAAGTCATTTTTAATGAA 4448 

• • • ■ ■ 

434 8 GAGGGGAGATGTTATCGATTATTCATCATCAGGGGTTTCCACCAACGATG 4397 

M i M I M M I M M M I M M M M M M I M M I M M I 
4449 GAGGGGGGATGTCATAGATTACTCCTCATCAGGGGTGTCCACTAATGAGG 4498 

• » • * • 
4398 CTTCCCCCCTGGATCCTATCACTGAAGAAGATGAAAAATCAGATCAGTCA 444 7 

I II I I I I I I I I II II I I I I I I I I I I I I I I Mill II I I I I I I 
4499 CCTCACCCCTGGACCCCATTACTGAAGAAGATGAGAAATCCGACCAGTCA 454 8 

• • ■ • • 

4448 GGCAGTAAGCTTCTCCCAGGCAAGAAATCTTCCGAAAGGTCAAGCCTCTT 4497 

II I I I I I I I I I I I I I I I I I I I I I I I I I II III II I II II I I I 
4549 GGTAGTAAGCTACTCCCAGGCAAGAAATCCTCAGAACGGCCCAGTCTCTT 4598 

• • " • • « 

4498 CCAGACAGATTTGAAGCTTAAGGGAAGTGGGCTGCGCTATCAAAAACTCC 4547 

M M M M I M M M M M M M MM M M M I M M M I I 
4599 CCAGACAGACTTGAAGCTTAAGGGGGGTGGTCTTCGCTACCAGAAACTGC 4 64 8 

• • • ■ • 
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4548 CAAGTGACGAGGATGAATCTGGCACAGAAGAATCAGATAACACTCCACTG 4597 

i 1 1 1 1 1 II Hill II II II II I i 1 1 1 1 1 1 II I i 1 1 1 I i 1 1 II 

4 649 CCAGTGATGAAGATGAGTCGGGGACAGAAGAGTCAGATAACACCCCACTG 4 698 

• • • . • • 

4598 CTCAAAGATGACAAAGACAGAAAAGCCGAAGGGAAAGTAGAGAGAGTGCC 4 647 

II I I I I I II II II I I II III II I I I II Mil I i I I I I I I i I 
4699 CTCAAAGATGATAAGGACAAAAAGGCTGAAGGCAAAGCAGAGAGAGTGTG 4748 

• • • • • 

4 648 GAAGTCTCCAGAACACAGTGCTGAGCCGATCAGAACCTTCATTAAAGCCA 4697 

I I I i I I I I I I 11 I I I I I I HIM i I M M I I M I M M I M i i 
4749 CAAGTCCCCAGAGCACAGTGCCGAGCCCATCAGAACGTTCATTAAAGCAA 4798 

4698 AAGAGTATTTATCGGATGCGCTCCTTGACAAAAAGGATTCATCGGATTCA 474 7 

I 11 I i I M I I II Mill I M I M II 1 i I I i I I 11 M I II Mill 
47 99 AAGAGTATTTGTCCGATGCCCTCCTTGACAAAAAGGATTCCTCAGATTCG 4848 

• * • • • 

4748 GGAGTGAGATCCAGTGAAAGTTCTCCCAATCACTCTCTGCACAATGAAGT 4797 

M I I II M Mil ill I II M M 11 I I II II I il I II I i I II I I 
484 9 GGAGTGAGGTCCAATGAGAGTTCTCCCAACCACTCTCTCCACAATGAGGC 4898 

4798 GGCGGATGACTCCCAGCTTGAAAAGGCAAATCTCATAGAGCTGGAAGATG 4847 

M II i i M il I i II II II II il 11 i I I M II II M I I il M 11 
4899 AGCAGACGACTCCCAACTGGAAAAGGCAAACCTCATAGAGCTTGAAGATG 494 8 

4848 ACAGTCACAGCGGAAAGCGGGGAATCCCACATAGCCTGAGTGGCCTGCAA 4897 

I i I II II II II I I H II II II I I I II I I II I I I I I II I I i 
4949 AGGGCCACAGTGGGAAGCGGGGGATGCCACACAGTCTGAGTGGCTTGCAA 4998 

■ • • • • 

4898 GATCCAATTATAGCTCGGATGTCCATTTGTTCAGAAGACAAGAAAAGCCC 4947 

i M I M il I II II II II I I II II M I I II II II I I II I M I I II I I I I 
4999 GATCCAATTATAGCTCGGATGTCCATTTGCTCGGAAGACAAGAAAAGCCC 5048 

• • • • • 

4948 TTCCGAATGCAGCTTGATAGCCAGCAGCCCTGAAGAAAACTGGCCTGCAT 4997 

i I I I I I I I M I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 

504 9 TTCCGAGTGCAGCCTGATTGCTAGCAGCCCTGAAGAAAGCTGGCCTGCGT 5098 

• • • • • 
4998 GCCAGAAAGCCTACAACCTGAACCGAACTCCCAGCACCGTGACTCTGAAC 5047 

II II I M I II I M il i I I i M M M I il I M M I M I I I M II M 
5099 GCCAGAAGGCCTACAATCTAAACCGAACGCCCAGCACTGTGACTCTGAAC 514 8 

• « • • • 

5048 AACAATAGTGCTCCAGCCAACAGAGCCAATCAAAATTTCGATGAGATGGA 5097 

Mill I ill II I M I M II M M M il II I 11 I i i M II I II 
5149 AACAACACTGCACCCACCAACAGAGCCAATCAGJ^ACTTCGATGAGATAGA 5198 

> • • • • 

5098 GGGAATTAGGGAGACTTCTCAAGTCATTTTGAGGCCTAGTTCCAGTCCCA 5147 

i i I I il I I 11 II I I 11 11 II I i M il M il I i I 1 I I I M 1 M 
5199 GGGAATCAGGGAGACGTCCCAGGTCATTCTGAGGCCTGGCCCCAGTCCCA 5248 

• « • • ■ 
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5148 ACCCAACCACTATTCAGAATGAGAATCTAAAAAGCATGACACATAAGCGA 5197 

M M I I I II I I I II I I I I I I II II I I I II I I II I I I I I I 
5249 ACCCAACTGCTGTGCAGAATGAAAACCTGAAGAGCATGGCGCACAAGCGA 5298 

5198 AGCCAACGTTCAAGTTACACAAGGCTCTCCAAAGATCCTCCGGAGCTCCA 5247 

i I N I II II I I I I I I I t II I I I II f i I II I I i I I i I I II 11 
5299 AGCCAGCGCTCCAGTTACACAAGGCTCTCGAAAGATGCATCTGAGCTACA 5348 

5248 TGCAGCAGCCTCTTCTGAGAGeACAGGCTTTGGAGAAGAAAGAGAAAGCA 5297 

I I I I 11 I I I I i I I I I I I I I I I I I I I I I II I I I I I INI! I I 11 
534 9 T. . .GCAGCCTCCTCGGAGAGCACAGGCTTTGGAGAAGAGAGAGAGAGCA 5395 

• • • - . 

5298 TTCTTtgagaaaaacaagcaaaggagaagagtgttactgtacccttatga 5347 
I M I II I I I I I I II I II III 

5396 TTCTCtgag gaaggagaag tga 5417 

• 

5348 cagaattgtcctggattttgactccatccacgcccatcacctttctacat 5397 

I I i M I I I I I I I I II I I II I I II I 
5418 aaagagtgtgacagactttgagttgaccattcctcattactcatcatgtt 54 67 

5398 tttgc. tgacagataactaaccgatgatgaggccgaggtaaaagagacat 5446 
I I M I I I I I i I I I I I I I I I I I I III II 

54 68 tatgcatcacagacacctgataggtggtcaggcactggaagcag 5511 

• 

5447 ctgcagtgtgacagaagggagcatgagaagcatggctcaccagccagcct 5496 

I I I I I I I I I I I I I i I I I I II II II I 
^^12 -acaggaaagcctga-aagcacag. tcaccagtcatttt 5547 

• • • • • 

54 97 ctgtggtctttgtaattagaagc. . .ttcagaactcactaatactactgt 5543 

I I ill II II III III Mill! Mill 
5548 c.attgtcaccgtcatcccaggcactttaaagtctcactgccactattac 5596 

* * • • • 

554 4 acctttcattggcgcattaccccataaaactttttgagacgaggtgagat 5593 
I M I I M I I I I I I I III I II II III 

5597 acctttc.ctggcacacagctaggcagagc. . gatgagagcagg 5637 

• 

5594 ctgagtataaagataggtcagaagt attttaaagggcttaat 5635 

nil I III llllllll Mil I II lllllllllll 

5638 ctgaagacaaa.ataggtcaaaagtgtatttaaagttgaaagggcttaa. 5685 

5636 gtgccaaaaagaaaaaaagctagagaccctttttgcaaacatttggtgac 5685 

I I i I II I II II I I I I I I I I III II I I 11 II 
5686 • .atcacaaggaaaaaaacttagagatetcctttataaactattcatgta 5733 
• 

568 6 cacacatttgagggaagacgtggcgttaggtgaagcagaagcaaaccctg 5735 

i I I i II I i i t I I I II I I I I I I I I I I I I I I I I I II I 

5734 c, . -catttgagggaagatacagtgttagatgaagcagaaaccaatttta 5780 
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5736 ctcttaggggctcacctaggtg agtgcacagcctgtg 5772 

I I I I i I I I I I I I I I II I I 

5781 ttgttagtgtttaatcttggtgcagatttataagttttagagtagcccag 5830 

5773 acgctacagggagaggctgagtaaaccgagatccagcgttctgtatggca 5822 

I I I I I I I I I I I M i I I I I I I I I I I I I I I I I 
5831 agactaaaagtgaatacttagcaaatggatagccagtgttctatatag. . 5878 

5823 ggggtattgcttatcacagaggttctgaag. . . . agtaggaagtacataa 58 68 

I I I I I I I I I MINI I I I I I I I I I I I I I I I I 
5879 gaatcattgctttt . - cagagggcttaaagtttaagtagaaaatatatac 5926 

5869 tgaagagggctttaaaaattgccaacaaagtgagtcaccagggctggcag 5918 

I I I I I I I I I I I I I II I I i M I I I I I I I I I I II 
5927 tcaagaaggcgataaaagctgatgagaaagtgagttagcagaacccaaag 597 6 

5919 tagtgtgacggggctgtcctgagctgttaggagagtagatgcggggaggg 5968 

III I I II i I Ml II II I I I I M I I I I I I I I I 

5977 ccgtg . • . ctgggccgcggtgactcattagcagag . . gaggaggggaggg 6021 

5969 ctggtgacctccgtgggtttatatgtcggaaactcttctctccaaatccc 6018 

I II I I I I I I I I I -I I I I I M I I II 
6022 c agtatattcctgggatac. . . tctctccaga* .cc 6052 

6019 aggcctggcttccagcaccatccagctgtgcccaagaagccaccctggtc 6068 

I I 1 I I I I I I I II I I I I I I I I I I I I I I II M I I III 
6053 cagcctggcttctgacatcatccacctgtgccctcaaaaccgtct:1:agtc 6102 

6069 tgttctccaactcttttaaatggtgcccaacttttctaagtgagcttagc 6118 

II II II I II II I I I I I I I III i II I I I II I i 

6103 tgttctgcaactc. - ttaagtg acatacctaactcagctcatg 6143 

■ • • • • 

6119 aatgagaagaaaaaa 6133 

I II I I I I I II 

• 6144 gctaaggaaaaaaaattaaagttgtcctggtgattaaactctggacctcc 6193 

• • • • V 

6134 aaacatgaat 6143 

II II M I 

6194 cacatctaagtcccgagttgacaaactgcatccccagcccaaacatctgt 6243 

• • * • • 
6144 tctt tttctggaaaatcagggagacatgggtaataataggtacta 6188 

II I I I I I I I 11 I II II I I I I I II 
6244 tcttccagagatctggaaagacatgggcataagagt oca 6282 

• • • • • 
6189 ataaatatttatagatgagtgaatgaggaaataattacatcaaaaaggt . 6237 

Nil Ml II II I I I II I I I I I I I I I Ml 
6283 gtaaactctta. . . atgaaggtgtcgggaagtgtttcccttaaaggggtt 6329 
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6238 cagtgacaattgataaatgacaaggaaatatttaattaggtaaaa 6282 

I I I I I I M I I I I I I I I I - I I i I I I I I I I I I 

6330 agtgacagtgacactagacaagtgac agaatttagtttgctgaca 6374 

. • • • • • 

6283 ctaaatcattgctctctatactaggatagactttatctacttcatctgtt 6332 

I I I M I I i I I III I I III I I I I N I I II I 

6375 ttaaatcactgccttctgggccgcaatag. .ttta. .... .gtatctgtt 6416 

* • 

6333 cctaagtcagcatgttagttctggggaaggatcataagaaaggaaatact 6382 

I II I I N I I I II I I I I I I II II i UN I I I I 

6417 cctaaatcagcatttgagttctgcgggggttacaaaatggaggca. .tgg 64 64 

• • . • • 
6383 ttttaaaaaaaaatttggaaacatgtaacaaagcaagggtaaaatatata 6432 

II M II II I II I I i II M I I I II II II III 

64 65 ttttaagaaggaatttgaaagcatccagggaagcaagaacagagt 6509 

• • . • 
6433 tatatatctatataagtgctgtgactgtaaaagtgtactttccattaatt 6482 

I I I I I I I I I II II II I 11 II I II 

6510 cggagctggtgtgaccatga cacttcccatt . att 6543 

• • - ♦ 

6483 attagccgagttaagagaatggtc acattgaagtactgtgtggact 6528 

I I I II II i i I I I I I I I. I I I I I I I I I I I I I I I M 

6544 gttagcaaagctaggagaatgaccacatacatttaagtatcatgtaggct 6593 

• - • • • 
6529 aga.aatgtaccctgtcatcatgcaatgaaatattgttatcgttttaaca 6577 

N I 11 I I I I I I I I I I I i i I I I I I i i I I i I 

6594 agagcgtgcctgtcatcatcatgcattgagaccgt .ttgtcctttgatta 6642 

■ • • • • 

6578 tagctcatttatgtagaatgaattctggtggtttaccccaagtcacagtt 6627 

Ml M I II I I I I I I I I I i III II I I I I i 

6643 tagtgtgtttatgtag-accagtcctagcagtatgtcctgagccatgggc 6691 

• • • • • 
6628 aggacggtagatggtgagatcgcagatgcgctattatctagattcagtgt 6677 

MM I II II II I III I I I II II I I II 

6692 aggattgtagatggcgaggtggc aggtttagtgc 6725 

6678 tacattttcgatgtttatcactcagtgggtttttattaatatgctgatta 6727 
I M Mill I I I I I I I I I II 

672 6 t. .cgttgagatgtccatctctgactggacctgt 6757 

• • . . 

6728 agttatttactgggccagtcattgtgctaaatagttgctcttttgtgttt 6777 

M M II II I M I I II I I III 
6758 .gtggtttacttg tagctgctc cgtgaaa 6785 

• 

6778 cattgccttgatgtttgagtgtaatctagcattttaatacagtgtttatt 6827 

M I I I 11 II I I I I 11 I II I I I I I I i I I I I II II I I I I I Ml* 
67 8 6 cactgccttgctgtttgagtgtaattgagcattttcatgaagtgtatatg 6835 

• • • . . 
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6828 ttgcatgatctttaacaaatgttttaagcaattttaaaaaggcaggatgt 6877 

I I I I I I I I I I I I I I I I I . I I i I I I I I i I I I I I I I I I I I 
6836 ttgcatgatctttaacaggtatttttag.acattaagaacagcatgatat 6884 

• • • • • 

6878 tattgacattatacactgaagtcttaacattttaacatttatagtgctta 6927 

I I 1 I I I I I I I I I i I I 
6885 tactgaca tatgcattagtag 6905 

• • • • « 
6928 tttgcaaaattgtataattaggaattatttcagagacaatgt . tttcttt 6976 

I I I I I II i J I I I I I I I I I I I I I I I I I II II 
6906 tatgcagtatcatgtaatggg. • . . tacttcagaggcagtttggtttctt 6951 

■ • ■ • • 

6977 ttcaggtgagtagttgccgcgtaatatcattggagtacattctttatact 7026 

I I I II II II t II M II I 11111111111111111 
6952 tccaggtgagtagttatagctcacagtcatgggaatcctttattcacact 7001 

■ • • ■ * 
7027 gtttgtgaaattaatactagcatattaagtgtacaaatagatttagaaaa 707 6 

111111111 II II I I II 11 III I I I I I M 
7002 gtctgtgaagtcgatgctggagtgccaaatgcacacggagactagg. .aa 704 9> 

7077 caataaaaaattgcatgc 7094 

1111)11 I I i I II I II 
7050 caataaagacttgcatgc 7067 
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1 gtcctctgac aaggtctgcg gccctgtgag ctggctctac agagccgagg gcggagtcct 
61 cagccaggcc gagatacggt gaattaaagt cggcaataga tttctgtgaa aatgtcagtt 
121 cttatatcac agagtgtgat aaattatgtg gaagaagaga atattcctgc tctgaaagct 
181 cttcttgaga agtgcaaaga tgtagatgag aggaacgagt gtggccagac gccgctgatg 
241 ctggctgcgg agcagggcaa tgtggaaata gtgaaagaac tgttaaagaa tggagccaac 
301 tgcaatctgg aagatctgga taactggact gcccttatat cggcatctaa ggaggggcac 
361 atccacattg tggaggagct gctgaagagc ggggccagtc tggagcaccg agacatgggt 
421 ggatggacag ctctcatgtg ggcgtgctac aaaggcagga ccgacgtggt cgagctgctt 
481 ctttctcatg gtgccaaccc aagtgtcact ggtttgtata gtgtctaccc aatcatctgg 
541 gcagcgggca gaggccatgc cgacatagtg catcttctac tgcagaatgg tgcgaaagtc 
601 aattgctccg acaagtacgg aacaaccccc ttggtatggg ctgcccgaaa gggtcatttg 
661 gaatgtgtga agcacttgtt ggccatggga gctgatgttg atcaagaagg agccaattcg 
721 atgactgcac ttatcgtggc agtgaaagga ggctacacac agtccgtgaa agagattttg 
781 aaaaggaatc caaatgtgaa cctaactgac aaggatggaa acacagctct gatgatcgca 
841 tcgaaggagg gacatattga gattgtgcag gacctgctgg acgccggaac atacgtgaac 
901 atacctgaca ggagtgggga tactgtattg atcggcgctg tgagaggtgg tcatgttgaa 
961 attgttcgag cacttctcca aaaatatgct gatatagaca ttagagggca ggacaataag 
1021 accgcgctgt attgggccgt tgaaaaggga aatgcgacaa tggtgaggga tatcctgcag 
1081 tgcaatccgg acacggaaat atgcacgaag gatggtgaga caccactgat aaaagccacc 
1141 aagatgagaa atattgaagt ggtggagctg ctgctggata agggggctaa ggtgtctgct 
1201 gtggacaaga aaggggacac tcccctgcac gttgccatcc gagggaggag ccggagactg 
1261 gccgaacttc ttttacggaa ccccaaagat ggacggttac tctacagacc caacaaagca 
1321 ggcgagactc cctacaacat tgactgcagc caccagaaaa gcattttaac tcaaatattt 
1381 ggagccagac acttgtctcc tacggaaaca gatggcgaca tgcttggtta tgatttgtat 
1441 agcagtgccc tggccgatat tctcagtgaa cccaccatgc aaccacccat ttgtgtgggc 
1501 ttgtatgcac agtggggaag tgggaagtct ttcttactca agaaactaga agatgagatg 
1561 aagacgtttg caggacagca gaccgagccc ctcttccagt tctcgtggct catagtgttc 
1621 ctcaccctgc tgctgtgtgg cgggctcggc ctagtgtttg ctttcacggt tgatacaaac 
1681 ctcgccatag ccatttcatt gagtttcctg gctctcatat atatattctt catcgtcatt 
17 41 tacttcggtg gacggcggga aggagagagt tggaactggg cctgggctct cagtaccaga 
1801 ctggcaagac acattgggta cctggagctc ctcttcaaac tgatgtttgt gaacccacct 
1861 gaactgccgg agcaaactac aaaggcttta cctgtgaggt ttttgtttac agattacaat 
1921 agactctcca gtgtgggagg agaaacttcc ttggctgaaa tgatcgcgac cctctcagat 
1981 gcttgtgaga gagaattcgg ctttttggct acccgtcttt ttcgagtctt caggactgaa 
2041 gagtctcagg gtaaaaagaa atggaaaaaa acatgctgcc tcccctcttt tgtcatcttt 
2101 ctttttattg ttggctgcat tattgctggg atcacccttc tggctatatt cagagttgac 
2161 ccgaaacatc tgacagtgaa tgctatcctc atttctatcg catctgtcgt ggggttggcc 
2221 ttcgtgctca attgtcgaac atggtggcaa gtgctggact ctctcctgaa ttctcagaga 
2281 aaacgcctcc atagtgctgc ctccaaatta cacaagttaa aaagtgaagg attcatgaaa 
2341 gtcctgaagt gtgaggtgga gctgatggcc aggatggcaa aaaccattga cagcttcact 
2401 caaaaccaga caaggctggt ggttatcatt gacggcctgg atgcctgcga gcaggataaa 
24 61 gtccttcaga tgctggacac tgtccgagtt ctgttttcaa agggcccttt tattgccatt 
2521 tttgcaagtg atccgcatat tatcataaag gccatcaacc agaacctgaa tagtgtgctt 
2581 cgtgactcaa atataaacgg acatgactac atgcgcaata tagttcactt accagttttt 
2641 cttaatagtc gtgggcttag caatgcaagg aaattccttg taacttcagc aacaaatggg 
2701 gacattacat gctcagacac cacaggaaca caggaggaca ctgacagaag agtttcacag 
27 61 aacagccttg gggagatgac aaagcttggg agcaaaacgg ctctcaacag aagggacact 
2821 taccgcagaa gacagatgca gagaaccatc acgaggcaga tgtcctttga tctcacaaag 
2881 ctgctggtca ctgaggattg gttcagtgac atcagccctc agaccatgag acggctactc 
2941 aatattgttt ctgtgacagg gcggctacta agagccaatc agattacttt caactgggac 
3001 aggctagcca gctggatcaa cctcaccgag cagtggccgt accggacgtc ttggcttata 
3061 ctatatctgg aagagactga aggtctccct gatcagatga ccttaaagac catctatgag 
3121 agaatatcga agaatatccc aacaactaaa gatgttgagc cgcttcttga aattgatgga 
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3181 gatataagaa actttgaagt cttcttgtct tcaaggactc cagttctcgt ggctcgagat 
3241 gtaaagacct ttttgccgtg cacagtaaac ctggatccca aacttcggga gatcatcgca 
3301 gatgtccgag ctgcaagaga gcagatcaac ataggaggcc tggcctatcc cccgctccct 
3361 ctgcacgaag gccctcctcg gccaccttct gggtacagtc agcctgcatc tgtctgctct 
3421 tcagcatctt tcaacgggcc gttcccaggc ggggttgtgt ctccacagcc ccacagcagt 
3481 tactacagcg gcttgtccgg accccagcac cccttctaca acaggccatt ctttgcccca 
3541 tacctttaca cgccaaggta ttaccctggc ggttcccaac atctcatctc acgttcatca 
3601 gtaaaaacga gtttgcccag agatcagaac aatggcctac cgtgtgactc tgggtttaac 
3661 aaacagagac aggcagccgt ccctgccaca ggcagctcac tgttactgag ttcaatgacc 
3721 gtggatgtcg tatgtgagaa actgagacag atagaagggc tggaccagag catgatgcct 
3781 cagtactgca caacaatcaa aaaggcgaac ataaatggcc gggtattgtc tcagtgtaac 
3841 attgatgagc tgaagaaaga gatggctatg aactttggag actggcatct cttcagaagc 
3901 atggtcttag aaatgagaag tgtggaaagc caggtggtcc ctgaagaccc tcgttttctc 
3961 aatgaaaact ccagtgcccc agttcctcat ggagagtctg ctcgccgctc ttctcacact 
4021 gagctgcccc tcacggagct ttccagccag actccataca ccctgaactt cagctttgaa 
4 081 gaactgaaca cgcttggcct tgatgaggga gcccctaggc acagtaacct gagttggcag 
4141 tcacaaactc gcagaacccc aagtctctcg agtctcaatt cccaggactc cagtattgaa 
4201 atttcaaagc ttactgataa ggtgcaggcc gagtatagag acgcctatag agagtacatt 
4 261 gctcagatgt cccagttaga agggggcaca gggtcgtcca caataagcgg cagatcttct 
4321 ccacatagca catactacat aggccagagc tcttcagggg gctccatcca ttctactcta 
4381 gaacaggaaa gggggaagga gggtgagcta aagcaggagg atgggcggaa gtcattttta 
4441 atgaagaggg gggatgtcat agattactcc tcatcagggg tgtccactaa tgaggcctca 
4501 cccctggacc ccattactga agaagatgag aaatccgacc agtcaggtag taagctactc 
4561 ccaggcaaga aatcctcaga acggcccagt ctcttccaga cagacttgaa gcttaagggg 

4 621 ggtggtcttc gctaccagaa actgcccagt gatgaagatg agtcggggac agaagagtca 
4681 gataacaccc cactgctcaa agatgataag gacaaaaagg ctgaaggcaa agcagagaga 
4741 gtgtgcaagt ccccagagca cagtgccgag cccatcagaa cgttcattaa agcaaaagag 
4801 tatttgtccg atgccctcct tgacaaaaag gattcctcag attcgggagt gaggtccaat 
4861 gagagttctc ccaaccactc tctccacaat gaggcagcag acgactccca actggaaaag 
4921 gcaaacctca tagagc.ttga agatgagggc cacagtggga agcgggggat gccacacagt 
4981 ctgagtggct tgcaagatcc aattatagct cggatgtcca tttgctcgga agacaagaaa 
5041 agcccttccg agtgcagcct gattgctagc agccctgaag aaagctggcc tgcgtgccag 
5101 aaggcctaca atctaaaccg aacgcccagc actgtgactc tgaacaacaa cactgcaccc 
5161 accaacagag ccaatcagaa cttcgatgag atagagggaa tcagggagac gtcccaggtc 
5221 attctgaggc ctggccccag tcccaaccca actgctgtgc agaatgaaaa cctgaagagc 
5281 atggcgcaca agcgaagcca gcgctccagt tacacaaggc tctcgaaaga tgcatctgag 
5341 ctacatgcag cctcctcgga gagcacaggc tttggagaag agagagagag cattctctga 
5401 tgaggaagga gaagtgaaaa gagtgtgaca gactttgagt tgaccattcc tcattactca 
54 61 tcatgtttat gcatcacaga cacctgatag gtggtcaggc actggaagca gacaggaaag 
5521 cctgaaagca cagtcaccag tcattttcat tgtcaccgtc atcccaggca ctttaaagtc 
5581 tcactgccac tattacacct ttcctggcac acagctaggc agagcgatga gagcaggctg 

5 641 aagacaaaat aggtcaaaag tgtatttaaa gttgaaaggg cttaaatcac aaggaaaaaa 
5701 acttagagat ctcctttata aactattcat gtaccatttg agggaagata cagtgttaga 
5761 tgaagcagaa accaatttta ttgttagtgt ttaatcttgg tgcagattta taagttttag 
5821 agtagcccag agactaaaag tgaatactta gcaaatggat agccagtgtt ctatatagga 
5881 atcattgctt ttcagagggc ttaaagttta agtagaaaat atatactcaa gaaggcgata 
5941 aaagctgatg agaaagtgag ttagcagaac ccaaagccgt gctgggccgc ggtgactcat 
6001 tagcagagga ggaggggagg gcagtatatt cctgggatac tctctccaga cccagcctgg 
6061 cttctgacat catccacctg tgccctcaaa accgtcttag tctgttctgc aactcttaag 
6121 tgacatacct aactcagctc atggctaagg aaaaaaaatt aaagttgtcc tggtgattaa 
6181 actctggacc tcccacatct aagtcccgag ttgacaaact gcatccccag cccaaacatc 
6241 tgttcttcca gagatctgga aagacatggg cataagagtc cagtaaactc ttaatgaagg 
6301 tgtcgggaag tgtttccctt aaaggggtta gtgacagtga cactagacaa gtgacagaat 
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6361 ttagtttgct 
6421 aatcagcatt 
6481 gaaagcatcc 
6541 attgttagca 
6601 gcctgtcatc 
6661 cagtcctagc 
6721 agtgctcgtt 

67 81 tgaaacactg 

68 41 tgatctttaa 
6901 agtagtatgc 
6961 gtagttatag 
7021 gagtgccaaa 
7081 ggagttttat 



gacattaaat 
tgagttctgc 
agggaagcaa 
aagctaggag 
atcatgcatt 
agtatgtcct 
gagatgtcca 
ccttgctgtt 
caggtatttt 
agtatcatgt 
ctcacagtca 
tgcacacgga 
tcaatagagc 



cactgccttc 
gggggttaca 
gaacagagtc 
aatgaccaca 
gagaccgttt 
gagccatggg 
tctctgactg 
tgagtgtaat 
tagacattaa 
aatgggtact 
tgggaatcct 
gactaggaac 
aatgataatt 



tgggccgcaa 
aaatggaggc 
ggagctggtg 
tacatttaag 
gtcctttgat 
caggattgta 
gacctgtgtg 
tgagcatttt 
gaacagcatg 
tcagaggcag 
ttattcacac 
aataaagact 
cccattctga 



tagtttagta 

atggttttaa: 

tgaccatgac 

tatcatgtag 

tatagtgtgt 

gatggcgagg 

gtttacttgt 

catgaagtgt 

atattactga 

tttggtttct 

tgtctgtgaa 

tgcatgccat 

ttaaaaatta 



tctgttcctc 
gaaggaattt 
acttcccatt 
gctagagcgt 
ttatgtagac 
tggcaggttt 
agctgctccc 
atatgttgcc 
catatgcatt 
ttccaggtgc 
gtcgatgctc 
tctgactcat 
aaacacggaa 
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CGAP UniGene OMIM 
SAGE Gene to Tag Mapping 




UniGene cluster iH -19873 | [Homo sapiens 



PubMed 



Entrez 



BLAST 



Hs.9873 : likely homolog of rat kinase D-interacting substance of 220 kDa; KIAA1250 
protein 

SAGE library data and reliable tag siinmiary : 
Reliable tags found in SAGE libraries: 

CTAAAAAAAA 





Tags per 


Tag 




Library name 


million 


counts Total tags 


SAGE Duke 1273 


51 


2 


38836 


SAGE Duke thalamus 


246 '^mm^ 


6 


24371 


SAGE Duke HI 020 


114 


6 


52371 


SAGE Duke BB542 normal 




12 


40909 


cerebellum 


SAGE 293-CTRL 


46 -^a- 


2 


43442 


SAGEHCT116 


165 


10 


60322 


SAGE Caco 2 


487 mm-. 


30 


61601 


SAGE Chen LNCaP 


176 -4^1^ 


11 


62267 


SAGE Chen LNCaP no-DHT 


61 '^fe- 


4 


64631 


SAGE Chen Normal Pr 


105 -^^^ 


7 


66193 


SAGE Chen Tumor Pr 


204 


14 


68384 


SAGECAPANl 


26 


1 


37926 


SAGE CAPAN2 


129 


3 


23222 


SAGEHS766T 


95 


1 


10467 


SAGE Panel 


40 ^^M' 


1 


24879 


SAGE H126 


30 


1 


32420 


SAGE Duke H54 lacZ 


327 aiii^ 


22 


67101 


SAGE Duke H54 EGFRvHI 


454 mm 


26 


57164 


SAGE Duke H392 


139 


8 


57529 


SAGE Duke GBM HI 11 0 


870 mm 


61 


70061 


SAGE SW837 


409 


25 


60986 


SAGERKO 


249 


13 


52064 


SAGE CPDR LNCaP-C 


240 -^^^ 


10 


41590 


SAGE CPDR LNCaP-T 


339 


15 


44122 


SAGE 293-IND 


163 ''^rr 


4 


24481 
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SAGE PR3 1 7 normal 
prosiaie 


858 mm-- 


51 


59419 


o Avjii r^Kj 1 / prostate lUinor 






no 


SAGE pooled GBM 


355 


22 


61841 


SAGE BB542 whitematter 


379 


36 


94806 


SAGE NHAfSthV 


344 


18 


52196 


SAGE nonnal pool(6iti) 


174 ^^m^ 


11 


63064 


SAGE NCI 


39 '^^^ 


2 


50115 


SAGENC2 


60 


3 


49552 


SAGE Pane 91-16113 


265 ^m^^' 


9 


33941 


SAGE Pane 96-6252 


335 ^^^^ 


12 


35745 


SAGE OV1063-3 


179 


7 


38938 


SAGE Tul02 


121 


7 


57636 


SAGETU98 


81 <(^^ 


4 


49005 


SAGE Duke irihh-1 


164 "^^4 


8 


48488 


SAGE SciencePark MCF7 
control 3h 


169 


1 


5903 


SAGE SciencePark MCF7 
Control Oh 


278 


17 


61079 


SAGE SciencePark MCF7 
estradiol 3h 




o 




SAGE SciencePark MCF7 
estradiol lOh 


82 -^^^^ 


5 


60435 


SAGE lacZ 


161 ^s^fe^ 


3 


18528 


SAGE 95-347 


89 ^JS^-j: 


6 


67240 


SAGE 95-259 


152 


6 


39473 


SAGE 95-260 


132 


6 


45179 


SAGE 95-348 


429 ^^^H^ 


26 


60484 


SAGE PedGBMl 062 


1868 ^(Bte 


112 


59935 


SAGE Duke H341 


112 


5 


44563 


SAGE HOSE 4 


433 *mm 


21 


48413 


SAGE ES2-1 


31 


1 


31502 


SAGE OVP-5 


462 >«gSi^ 


5 


10802 


SAGE Medullo 3871 


392. «SB^ 


17 


43274 




1477 tfHttb ° 


1.? 




SAGE normal prostate 


684 


9 


13148 


SAGE LN-1 


525 i^ii!* 


12 


22835 


SAGEPERITO-13 


446 'Sig^' 


24 


53728 


SAGE MousePS PGCP 


114 --i^*- 


7 


61240 


SAGE Meso-12 


685 ^im^ 


24 


35032 


SAGEOC14 


400 Mi> 
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SAGE LNCaP 


353 -^mm-- 


8 


22637 


SAGE OVT-6 


188 <^ 


8 


42336 


SAGEHn26 


514 ^mm 


9 


17501 


SAGE OVT-7 


382 


21 


54914 


SAGE HMEC-B41 


709 mm 


1 


1410 


SAGEMDA453 


211 


4 


18924 


SAGE SKBR3 


735 ^mm 


6 


8153 


SAGE Duke HMVEC 


228 mm.^ 


12 


52532 


SAGE Duke 
HMVEC+VEGF 




KJ 




SAGE mammarv epithelium 


467 


23 


49167 


SAGE DCIS 


242 


10 


41230 


SAGE Duke 757 


358 ^ms^ 


7 


19503 


SAGE normal cerebellum 


254 


13 


51135 


SAGE OVT-8 


655 ^ms^ 


22 


33575 


SAGE Duke 40N 


140 


1 


7142 


SAGE Duke 48N 


165 "^J^^f 


2 


12091 


SAGE MLl 0-10 


228 mm^ 


13 


56943 


SAGE Duke H247 normal 


363 


22 


60543 


SAGE Duke H247 Hvpoxia 


347 ^^m^^ 


25 


71937 


SAGE Duke-H988 


285 ^m^ 


8 


28015 


SAGE Duke post crisis 
fibroblasts 


180 


4 


22207 


SAGE Duke prec^sis 
fibroblasts 




1 
1 


o/y 1 


SAGE DCIS 2 


242 


7 


28888 


SAGE Br N 


213 f^m^' 


8 


37558 


SAGEA+ 


130 


4 


30551 


SAGEIOSE29-11 


226 'MSi^ 


11 


48498 


SAGE Duke H1043 


195 


15 
Tag 


76673 


Library name 


million 


counts 


Total tags 


SAGE Caco 2 


16 ^^m-^ 


1 


61601 


SAGECAPANl 


26 ^^3^' 


1 


37926 


SAGE BB542 whitematter 


21 


2 


94806 


SAGENC2 


20 


1. 


49552 


SAGE 95-348 


16 


1 


60484 


SAGE HOSE 4 


20 


1 


48413 


SAGEPERITO-13 


18 


1 


53728 


SAGEOC14 


57 


1 


17476 


SAGE Dtike 


17 -m?^ 






HMVEC+VEGF 


1 


57928 
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GCTCACCAGC 



SAGE rnammarv epithelium 


20 -'SSP 


1 


49167 


SAGE normal cerebellum 


19 --a® 


1 


51135 


SAGE DCIS 2 


34 <^ 




28888 


- ■ 

Library name 


Tags p er 


Tag 




million 


counts 


Total tags 


SAGE Caco 2 


16 v^l^ 


1 


61601 


SAGE Chen Tumor Pr 


14 


1 


68384 


SAGE Duke H54 EGFRvm 


17 -H^r 


. 1 


57164 


SAGE Duke H392 


17 -i©®** 


1 


57529 


SAGE pooled GBM 


16 'a^S 


1 


61841 


SAGE normal pool(6th) 


31 ^i^* 


2 


63064 


SAGE Duke mhh-1 


.20 


1 


48488 


SAGEOVT-7 


18 «^ 


1 


54914 


SAGE Duke H247 normal 


16 «!3S( 


1 


60543 


SAGE Duke H247 Hvpoxia 


27 


2 


71937 


SAGE Duke H1043 


26 


2 


76673 



Number of SAGE libraries: 95 

Total tags in all SAGE libraries: 3 888724 

Summary of tags fovmd in this cluster : 

Cid- 

>tag 

Tag frequency Getseqs 

cDNA: well-characterized 

CTAAAAAAAA 1/4 



CTGCTTCCCT 



GCTCACCAGC 



1/4 
2/4 



NLM 



NIH 



j SAGEnet | GGEG | NCBI Help 



Otsciaimer 
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1 gtcctctgacaaggtctgcggccctgtgagctggctctacagagccgagg 50 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I 
1 gtcctctgacaaggtctgcggccctgtgagctggctctacagagccgagg 50 

• . . • _ . • • • 

51 gcggagtcctcagccaggccgagatacggtgaattaaagtcggcaataga 100 

I I I I I I I I I I I I I I I I I M I I I I I I I I M I I I I I I I I t I I M I I I I I I I I 
51 gcggagtcctcagccaggccgagatacggtgaattaaagtcggcaataga 100 

101 tttctgtgaaaATGTCAGTTCTTATATCACAGAGTGTGATAAATTATGTG 150 

I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
101 tttctgtgaaaatgtcagttcttatatcacagagtgtgataaattatgtg 150 

• • • . • * 

151 GAAGAAGAGAATATTCCTGCTCTGAAAGCTCTTCTTGAGAAGTGCAAAGA 200 

I I M I I I I I I I M I I I I I I I I I I i I I I I I I i I I I I I I 1 I I I I I I I I I I I i 
151 gaagaagagaatattcctgctctgaaagctcttcttgagaagtgcaaaga 200 

• > » • • 
201 TGTAGATGAGAGGAACGAGTGTGGCCAGACGCCGCTGATGCTGGCTGCGG 250 

1 1 1 1 i 1 1 i 1 1 1 1 1 1 i 1 1 1 1 1 i 1 1 1 1 1 i 1 1 1 M 1 1 1 1 1 n I I I I I I i I I I I 

201 tgtagatgagaggaacgagtgtggccagacgccgctgatgctggctgcgg 250 

• • • • • • ' 
251 AGCAGGGCAATGTGGAAATAGTGAAAGAACTGTTAAAGAATGGAGCCAAC 300 

I I i I I I I I I I t i f i I I 1 I I I i ( I M I i I i I I f I I I I I I 1 I I I I I I I I I I I 
251 agcagggcaatgtggaaatagtgaaagaactgttaaagaatggagccaac 300 

■ • • • • 

301 TGCAATCTGGAAGATCTGGATAACTGGACTGCCCTTATATCGGCATCTAA 350 

I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i i I i I I I I 
301 tgcaatctggaagatctggataactggactgcccttatatcggcatctaa 350 

• • . • • . 

351 GGAGGGGCACATCCACATTGTGGAGGAGCTGCTGAAGAGCGGGGCCAGTC 4 00 

I I M i I i I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I 
351 ggaggggcacatccacattgtggaggagctgctgaagagcggggccagtc 400 

• • • • • 

4 01 TGGAGCACCGAGACATGGGTGGATGGACAGCTCTCATGTGGGCGTGCTAC 450 

i I M I I I I I I I I I i I M I I I I I M I i M I I I I I I I I I I M I I I I I I I I I i 
4 01 tggagcaccgagacatgggtggatggacagctctcatgtgggcgtgctac 450 

• • ■ • • 

451 AAAGGCAGGACCGACGTGGTCGAGCTGCTTCTTTCTCATGGTGCCAACCC 500 

I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
451 aaaggcaggaccgacgtggtcgagctgcttctttctcatggtgccaaccc 500 

• • • * • • 
501 AAGTGTCACTGGTTTGCAGTATAGTGTCTACCCAATCATCTGGGCAGCGG 550 

I I M I I i I i I I I I I I I I I I I M I I M I I I I I I i I I I I I I I I I I I I I I 
501 aagtgtcactggttt. . .gtatagtgtctacccaatcatctgggcagcgg 547 
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• • • • • 

551 GCAGAGGCCATGCCGACATAGTGCATCTTCTACTGCAGAATGGTGCGAAA 600 

I I I I I I I 1 I I I I I I I I II I I i I I I I I I I I I I I I I I I I I I I I I I I ! I M I I 
548 gqagaggccatgccgacatagtgGatcttctactgcag 597 

601 GTCAATTGCTCCGACAAGTACGGAACAACCCCCTTGGTATGGGCTGCCCG 650 

I III I I I HI I I I U I I I II I I I II I II. Ml II II II i I I I I I II I I I II 
598 gtcaattgGtccgacaagtacggaacaacccccttggtatgggctgcccg 647 

651 AAAGGGTCATTTGGAATGTGTGAAGCACTTGTTGGCCATGGGAGCTGATG 700 

I I I I I II H I II II II 111 I I I II I I II I I I I II I 1! I H i I M II M II 
648 aaagggtcatttggaatgtgtgaagcacttgttggccatgggagctgatg 697 

701 TTGATCAAGAAGGAGCCAATTCGACGACTGCACTTATCGTGGCAGTGAAA 750 

I I I I I II II II I I I I I I I I I I I II I I I I I I I I I I I i I I II I I I I I I I I I 
698 ttgatcaagaaggagccaattcgatgactgcacttatcgtggcagtgaaa 747 

•• • _ • • 

751 GGAGGCTACACACAGTCCGTGAAAGAGATTTTGAAZ\AGGAATCCAAATGT 800 

I I I I 1 M I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II M I I I I I I 
748 ggaggctacacacagtccgtgaaagagattttgaaaaggaatccaaatgt 797 

- 

801 GAACCTAACTGACAAGGATGGAAACACAGCTCTGATGATCGCATCGAAGG 850 

I I M I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
798 gaacctaactgacaaggatggaaacacagctctgatgatcgcatcgaagg 847 

851 AGGGACATATTGAGATTGTGCAGGACCTGCTGGACGCCGGAACATACGTG 900 

I M I I I I M I I I I I I I 11 I.I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I 
848 agggacatattgagattgtgcaggacctgctggacgccggaacatacgtg 897 

901 AACATACCTGACAGGAGTGGGGATACTGTATTGATCGGCGCTGTGAGAGG 950 

iMIIIIillllllllllllllllllllllllllllllllllllllllll 
8 98 aacatacctgacaggagtggggatactgtattgatcggcgctgtgagagg 947 

• • • . . 

951 TGGTCATGTTGAAATTGTTCGAGCACTTCTCCAAAAATATGCTGATATAG 1000 

N I I I I II I I I I i I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
948 tggtcatgttgaaattgttcgagcacttctccaaaaatatgctgatatag 997 

• • • • ' • 
1001 ACATTAGAGGGCAGGACAATAAGACCGCGCTGTATTGGGCCGTTGAAAAG 1050 

M I I I II I I I I I II I I I I I I I I I I I I I I I I I I I i I M I I I I I I I I I I I I I 
998 acattagagggcaggacaataagaccgcgctgtattgggccgttgaaaag 1047 

• • • • • 
1051 GGAAATGCGACAATGGTGAGGGATATCCTGCAGTGCAATCCGGACACGGA 1100 

I I I I I II i I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I II 
1048 ggaaatgcgacaatggtgagggatatcctgcagtgcaatccggacacgga 1097 

• • • • • 
1101 AATATGCACGAAGGATGGTGAGACACCACTGATAAAAGCCACCAAGATGA 1150 

I N I I II II II II II I I 11 II I I II I I II II II II I II 11 I II II I I I I I 
1098 aatatgcacgaaggatggtgagacaccactgataaaagccaccaagatga 1147 
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1151 GAAATACTGAAGTGGTGGAGCTGCTGCTGGATAAGGGGGCTAAGGTGTCT 1200 

I I I i I I I I i I i I I i i I I I f I I I I I I I I I I i I I I I I I i I I I I I I I i I I I I 
1148 gaaatattgaagtggtggagctgctgctggataagggggctaaggtgtct 1197 

• ■ • • ' • 
1201 GCTGTGGACAAGAAAGGGGACACTCCCCTGCACGTTGCCATCCGAGGGAG 1250 

I I I I I i I I II I I i I 11 I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I M 
1198 gctgtggacaagaaaggggacactcccctgcacgttgccatccgagggag 1247 

• • • • • • 
1251 GAGCCGGAGACTGGCCGAACTTCTTTTACGGAACCCCA7\AGATGGACGGT 1300 

I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I M I I 
1248 gagccggagactggccgaacttcttttacggaaccccaaagatggacggt 1297 

■ • • • ■ 

1301 TACTCTACAGACCCAACAAAGCAGGCGAGACTCCCTACAACATTGACTGC 1350 

M II I I I i II I II II II II I I I I II I I II I I I I I I I I I II I 11 I I I I I I I 
1298 tactctacagacccaacaaagcaggcgagactccctacaacattgactgc 1347 

• • • • • 
1351 AGCCACCAGAAAAGCATTTTAACTCAAATATTTGGAGCCAGACACTTGTC 1400 

I I I I I I I I I I II II I I I II I I I II I I 1 II I I I I II I II II II I I II II I I 
1348 agccaccagaaaagcattttaactcaaatatttggagccagacacttgtc 1397 

• • - . . 
1401 TCCTACGGAAACAGATGGCGACATGCTTGGTTATGATTTGTATAGCAGTG 1450 

i I I I I I II i I I I 1 II I I I I I I t I I I I I I I I I I I I I I I I I I I I M I I 1 I II 
1398 tcctacggaaacagatggcgacatgcttggttatgatttgtatagcagtg 1447 

1451 CCCTGGCCGATATTCTCAGTGAACCCACCATGCAACCACCCATTTGTGTG 1500 

1 I II M II II II I I I II II II I I I I I I I I I II I II I II II II i I I I I I I I 
1448 ccctggccgatattctcagtgaacccaccatgcaaccacccatttgtgtg 1497 

• • • • . 
1501 GGCTTGTATGCACAGTGGGGAAGTGGGAAGTCTTTCTTACTCAAGAAACT 1550 

I I I I I I I I I I I I I I M II I I I I I I I I I I I I I II I I I I I M I I I I I M I II 
14 98 ggcttgtatgcacagtggggaagtgggaagtctttcttactcaagaaact 1547 

• • • • • « 
1551 AGAAGATGAGATGAAGACGTTTGCAGGACAGCAGACCGAGCCCCTCTTCC 1600 

I I I I II I I I I I I I I I I I I II II I I I 11 II I I I I II I I I I I I I I I I I I I I I 
1548 agaagatgagatgaagacgtttgcaggacagcagaccgagcccctcttcc 1597 

• * • • • 
1601 AGTTCTCGTGGCTCATAGTGTTCCTCACCCTGCTGCTGTGTGGCGGGCTC 1650 

M I I I I I I I I I II I I I I I II I I i I I I I II I I I I I I I I I I I I I I I I I I M I 
1598 agttctcgtggctcatagtgttcctcaccctgctgctgtgtggcgggctc 1647 

• • • . • 
1651 GGCCTAGTGTTTGCTTTCACGGTTGATACAAACCTCGCCATAGCCATTTC 1700 

M II 1 I I II II I II II I II II I II 111 j I II I M ) I II I I II 1 I II II 1) 
1648 ggcctagtgtttgctttcacggttgatacaaacctcgccatagccatttc 1697 

• • • • • 
1701 ATTGAGTTTCCTGGCTCTCATATATATATTCTTCATCGTCATTTACTTCG 1750 

I I I I I 1 I I I I I 1 I I I I i I I i I I I I I I I I I I )} I I I ) I I I I I I 1 I I I I I j I 
1698 attgagtttcctggctctcatatatatattcttcatcgtcatttacttcg 1747 

• • • • • 
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1751 GTGGACGGCGGGAAGGAGAGAGTTGGAACTGGGCCTGGGCTCTCAGTACC 1800 

i I I I I I I I I I i I I I I I i I I i I I I I I I i I I I I i I I I I I I I M I I I I I I I I I 
1748 gtggacggcgggaaggagagagttggaactgggcctgggctctcagtacc 1797 

• • • . • ' . 
1801 AGACTGGCAAGACACATTGGGTACCTGGAGCTCCTCTTCAAACTGATGTT 1850 

i I I I I III I I I I I I I II i II I I.I II I I II II M I I I M II II I I I i i I II 
1798 agactggcaagacacattgggtacctggagctcctcttcaaactgatgtt 1847 

1851 TGTGAACCCACCTGAACTTGCGGAGCAAACTACAAAGGCTTTACCTGTGA 1900 

i I I II I i II II i I I II II II I I i i I I I I I II I I I II I II II II N I j I 
1848 tgtgaacccacctgaactgccggagcaaactacaaaggctttacctgtga 1897 

• ■ • • • • 

1901 GGTTTTTGTTTACAGATTACAATAGACTCTCCAGTGTGGGAGGAGAAACT 1950 

I i i I I I I I I I I I I M I M I I M t I I I M I I I I M I I I I t I I I I i I I I I I I 
1898 ggtttttgtttacagattacaatagactctccagtgtgggaggagaaact 1947 

• • • • 

1951 TCCTTGGCTGAAATGATCGCGACCCTCTCAGATGCTTGTGAGAGAGAATT 2000 

I i i I M [ 1 1 i 1 1 i 1 1 1 M 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 

1948 tccttggctgaaatgatcgcgaccctctcagatgcttgtgagagagaatt 1997 
• 

2001 CGGCTTTTTGGCTACCCGTCTTTTTCGAGTCCTCAGGACTGAAGAGTCTC 2050 

I I I M I I I I M I I I I i I I i I I I I I i I I I I I I I I I M I I I I I t I M I I i I 
1998 cggctttttggctacccgtctttttcgagtcttcaggactgaagagtctc 2047 
• 

2051 AGGGTAAAAAGAAATGGAAAAGAACATGCTGCCTCCCCTCTTTTGTCATC 2100 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I 
2048 agggtaaaaagaaatggaaaaaaacatgctgcctcccctcttttgtcatc 2097 
• 

2101 TTTCTTTTTATTGTTGGCTGCATTATTGCTGGGATCACCCTTCTGGCTAT 2150 

I M I M I I I i I I i M I I M I I I I I I I I I I I I I I I I I I I I I M i I I I i I I I 
2098 tttctttttattgttggctgcattattgctgggatcacccttctggctat 2147 

2151 ATTCAGAGTTGACCCGAAACATCTGACAGTGAATGCTATCCTCATTTCTA 2200 

M I I i I I I I I I I I I I I I I i M I I I I I I I I I I I I I I I M I t 1 I I I I I I I I i 
2148 attcagagttgacccgaaacatctgacagtgaatgctatcctcatttcta 2197 

2201 TCGCATCTGTCGTGGGGTTGGCCTTCGTGCTCAATTGTCGAACATGGTGG 2250 

I I N I (11 I I f M I I I K I I I I I II I II II [ II I I M II II I I I I M I II 
2198 tcgcatctgtcgtggggttggccttcgtgctcaattgtcgaacatggtgg 2247 

2251 CAAGTGCTGGACTCTCTCCTGAATTCTCAGAGAAAACGCCTCCATAGTGC 2300 
I I I I I I I M I I I! I I I I I I I II I 11 I II I I I I I I I I I I I I I I I I I I I 11 I 

2248 caagtgctggactctctcctgaattctcagagaaaacgcctccatagtgc 2297 
• • • • . 

2301 TGCCTCCAAATTACACAAGTTAAAAAGTGAAGGATTCATGAAAGTCCTGA 2350 
I M I M It I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I II 1 I 

2298 tgcctccaaattacacaagttaaaaagtgaaggattcatgaaagtcctga 2347 
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2351 AGTGTGAGGTGGAGCTGATGGCCAGGATGGCAAAAACCATTGACAGCTTC 2400 

I I I I I I I I I M I I I I I M I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2348 agtgtgaggtggagctgatggccaggatggcaaaaaccattgacagcttc 2397 

• • • • • 
2401 ACTCAAAACCAGACAAGGCTGGTGGTTATCATTGACGGCCTGGATGCCTG 2450 

I M I I I I i I I I i I I I I I I I I I I I I i i I I I I I I I I I I I I I I I I i M I I J I I 
2398 actcaaaaccagacaaggctggtggttatcattgacggcctggatgcctg 2447 

• • • • • 
2451 CGAGCAGGATAAAGTCCTTCAGATGCTGGACACTGTCCGAGTTCTGTTTT 2500 

I I I I I I I I ) I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2448 cgagcaggataaagtccttcagatgctggacactgtccgagttctgtttt 2497 

• • • ■ ■ 

2501 CAAAGGGCCCTTTTATTGCCATTTTTGCAAGTGATCCGCATATTATCATA 2550 

I I i I I I I I I I i 11 I I I I I i I I I I I M I I i I I I I i I I I I I I I I I I M I I I I 
2498 caaagggcccttttattgccatttttgcaagtgatccgcatattatcata 254 7 

• • • • • 
2551 AAGGCCATCAACCAGAACCTGAATAGTGTGCTTCGTGACTCAAATATAAA 2600 

I I I I I I I I I I I I I I I 1 I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I i I I I 
2548 aaggccatcaaccagaacctgaatagtgtgcttcgtgactcaaatataaa 2391 

• • • • • 

2 601 CGGACATGACTACATGCGCAATATAGTTCACTTACCAGTTTTTCTTAATA 2650 

I I I I I I M I I I I i 1 I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 
2598 cggacatgactacatgcgcaatatagttcacttaccagtttttcttaata 2647 

• • • • • 
2651 GTCGTGGGCTTAGCAATGCAAGGAAATTCCTTGTAACTTCAGCAACAAAT 2700 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I i I I I I I I M I I I 
2648 gtcgtgggcttagcaatgcaaggaaattccttgtaacttcagcaacaaat 2697 

• • • • . 
2701 GGGGACATTACATGCTCAGACACCACAGGAACACAGGAGGACACTGACAG 2750 

I I i I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I M I I I I I I I I I I I I 
2698 ggggacattacatgctcagacaccacaggaacacaggaggacactgacag 2747 

• - • . . 
2751 AAGAGTTTCACAGAACAGCCTTGGGGAGATGACAAAGCTTGGGAGCAAAA 2800 

i I M I M I I I I I I I I I I I I I I M I I I I I I I I I M I I I I I I I M I I I M I I 
2748 aagagtttcacagaacagccttggggagatgacaaagcttgggagcaaaa 2797 

2801 CGGCTCTCAACAGAAGGGACACTTACCGCAGAAGACAGATGCAGAGAACC 2850 
I I I I i I M i I i I I I I I I i I I I I I I I i I I I I I I I I I I I I I I I i I I I I I I I I 

2798 cggctctcaacagaagggacacttaccgcagaagacagatgcagagaacc 284 7 

• • • 

2851 ATCACGAGGCAGATGTCCTTTGATCTCACAAAGCTGCTGGTCACTGAGGA 2900 
I M I I I I I I i i I M I I I I I I I I I I I I I I I I I I I I I I I 1 I I ) I I I I I I I 1 I 

2848 atcacgaggcagatgtcctttgatctcacaaagctgctggtcactgagga 2897 

• . • • . 
2901 TTGGTTCAGTGACATCAGCCCTCAGACCATGAGACGGCTACTCAATATTG 2950 

I M i I I I M 1 I I I I I I I I I I I I I I I I I I i I I I I I I I I I I i I M I i ( I I M 
2898 ttggttcagtgacatcagccctcagaccatgagacggctactcaatattg 2947 
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2951 TTTCTGTGACAGGGCGGCTACTAAGAGCCAATCAGATTACTTTCAACTGG 3000 

I M M I I I I i I I I I I I I I I I I M I M I I I I I I I I I I I I I I I I I I I I I I I I 
294 8 tttctgtgacagggcggctactaagagccaatcagattactttcaactgg 2997 

... - • 
3001 GACAGGCTAGCCAGCTGGATCAACCTCACCGAGCAGTGGCCGTACCGGAC 3050 

t Nil i I i I I i I t I I M ( II I i I ( I I I I I I I II i ( I I M I I If I I I I I I I 
2998 gacaggctagccagctggatcaacctcaccgagcagtggccgtaccggac 3047 

• • . -■ . 

3051 GTCTTGGCTTATACTATATCTGGAAGAGACTGAAGGTCTCCCTGATCAGA 3100 

I i I I I i I I I M I M I II II II I II II II II I II II II I II II II II II II 
3048 gtcttggcttatactatatctggaagagactgaaggtctccctgatcaga 3097 

■ 

3101 TGACCTTAAAGACCATC.TATGAGAGAATATCGAAGAATATCCCAACAACT 3150 

i M I I I II II I i I I I I II I I I II II II II I II I I j 1 I I I I I t I I I I I I f I 
3098 tgaccttaaagaccatctatgagagaatatcgaagaatatcccaacaact 3147 

3151 AAAGATGTTGAGCCGCTTCTTGAAATTGATGGAGATATAAGAAACTTTGA 3200 

I I M I I I I I I II I I I Ml I I II I I I II I I I II I I I II I I I II I I I II I II 
3148 aaagatgttgagccgcttcttgaaattgatggagatataagaaactttga 3197 

3201 AGTCTTCTTGTCTTCAAGGACTCCAGTTCTCGTGGCTCGAGATGTAAAGA 3250 

M I i I I I I I i M I I I 1 I I I I II I I I II I I I I II I I M I I I I I I I I II I I I 
3198 agtcttcttgtcttcaaggactccagttctcgtggctcgagatgtaaaga 3247 

3251 CCTTTTTGCCGTGCACAGTAAACCTGGATCCCAAACTTCGGGAGACCATC 3300 

I M I I I I I I I I I I I I I I I I I I I I I M M I I I 11 I I I I I I I I I I I I I I I I 
3248 cctttttgccgtgcacagtaaacctggatcccaaacttcgggagatcatc 3297 
• 

3301 GCAGATGTCCGAGCTGCAAGAGAGCAGATCAACATAGGAGGCCTGGCCTA 3350 

I I I I I I M I M I I I I I t I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I 
3298 gcagatgtccgagctgcaagagagcagatcaacataggaggcctggccta 3347 
• 

3351 TCCCCCGCTCCCTCTGCACGAAGGCCCTCCTCGGCCACCTTCTGGGTACA 3400 

I i I I I I I I I I I i I I I I I I I I I I I I I i I I I I I I I i I I I I I I I I I I I I I I I I 
3348 tcccccgctccctctgcacgaaggccctcctcggccaccttctgggtaca 3397 

3401 i:.TCAGCCTGCATCTGTCTGCTCTTCAGCATCTTTCAACGGGCCGTTCCCA 3450 

NNIIIINIIIIIillllllllllllllllllllllllllllllllll 
3398 gtcagcctgcatctgtctgctcttcagcatctttcaacgggccgttccca 3447 

3451 GGCGGGGTTGTGTCTCCACAGCCCCACAGCAGTTACTACAGCGGCTTGTC 3500 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I i i I I I 
3448 ggcggggttgtgtctccacagccccacagcagttactacagcggcttgtc 3497 

3501 CGGACCCCAGCACCCCTTCTACAACAGGCCATTCTTTGCCCCATACCTTT 3550 

IIIIIIIIIMMIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIII 
34 98 cggaccccagcaccccttctacaacaggccattctttgccccataccttt 3547 
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3551 ACACGCCAAGGTATTACCCTGGCGGTTCCCAACATCTCATCTCACGTTCA 3600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I 
3548 acacgccaaggtattaccctggcggttcccaacatctcatctcacgttca 3597 

• • • • • 

3601 TCAGT AAAAGCGAGTTTGCCCAGAGATCAGAACAATGGCCTACCGTGTGA 3650 

I M I I I I I I I I f I I I I I I I I I I I I I I I I I I i I I I I I I I i I I I I I i II I i 
3598 tcagtaaaaacgagtttgcccagagatcagaacaatggcctaccgtgtga 3647 

• • « • • 

3651 CTCTGGGTTTAACAAACAGAGACAGGGCAGCCGTCCCTGCCACAGGCAGC 3700 

I I I I I I I I I M I I I I I I I I I I I M I I I I I I I I I i I I I I I I I I I M I I I I 
3648 ctctgggtttaacaaacagagaca.ggcagccgtccctgccacaggcagc 3696 

• • ■ • • 

3701 TCACTGTTACTGAGTTCAATGACCGTGGATGTCGT.TGTGAGAAACTGAG 37 49 

11111111111111111111111111111111111 I I I I I I I I I I I I I I 
3697 tcactgttactgagttcaatgaccgtggatgtcgtatgtgagaaactgag 37 4 6 

• • • • • 

3750 ACAGATAGAAGGGCTGGACCAGAGCATGATGCCTCAGTACTGCACAACAA 37 99 

I I I I I I i I I i M I I I I I I I I I I I I I i I i I I M I I I I I I M I I I t i i i I i I 
3747 acagatagaagggctggaccagagcatgatgcctcagtactgcacaacaa 3;796 

• • • • • 

3800 TCAAAAAGGCGAACATAAATGGCCGGGTATTGTCTCAGTGTAACATTGAT 3849 

I I I I M ) I { I I I I 1 I I I I i I I I I I I i I I I I I t I I I I I I I I I I i i I I I I I I 
37 97 tcaaaaaggcgaacataaatggccgggtattgtctcagtgtaacattgat 384 6 

• • • • • 
3850 GAGCTGAAGAAAGAGATGGCTATGAACTTTGGAGACTGGCATCTCTTCAG 38 99 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I 
3847 gagctgaagaaagagatggctatgaactttggagactggcatctcttcag 3896 

• « ■ • • 

3900 AAGCATGGTCTTAGAAATGAGAAGTGTGGAAAGCCAGGTGGTCCCTGAAG 394 9 

I I I I I I I i I I I i 1 I I I I I I I I I I I I I I I I I I I i I I i i I I I I I I I I I I I M 
3897 aagcatggtctt agaaatgagaagt gtggaaagccaggtggtccctgaag 394 6 

• • • • • 
3950 ACCCTCGTTTTCTCAATGAAAACTCCAGTGCCCCAGTTCCTCATGGAGAG 3999 

I I I I I i I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I i I 
3947 accctcgttttctcaatgaaaactccagtgccccagttcctcatggagag 3996 

• • • • • 

4000 TCTGCTCGCCGCCCTTCTCACACTGAGCTGCCCCTCACGGAGCTTTGCAG 4 04 9 

I I I I I f i I I I I I I I I I I I I i I i I I I I 1 I I I I i I I I I I I I I M I I I I I I I 

3997 tctgctcgccgctcttctcacactgagctgcccctcacggagctttccag 404 6 

• • ■ • • 
4050 CCAGACTCCATACACCCTGAACTTCAGCTTTGAAGAACTGAACACGCTTG 4099 

I I I I I I I I i M I I I I I I I I M I i I I I I I I I I I I i I I I I I 1 I I i I I I M I I 
4 047 ccagactccatacaccctgaacttcagctttgaagaactgaacacgcttg 4096 

• • • • • 
4100 GCCTTGATGAGGGTGCCCCTCGTCACAGTAATCTAAGTTGGCAGTCACAA 414 9 

I I I I I I I I I I I I I Mini 1 1 i I 1 I II I II I II i I II I 1 II I I II 
. 4097 gccttgatgagggagcccctaggcacagtaacctgagttggcagtcacaa 4146 

• • • • « 
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4150 ACTCGCAGAACCCCAAGTCTCTCGAGTCTCAATTCCCAGGACTCCAGTAT 4199 
I I i I I I I I I i I I i I I i i I I I I I I I I I I I I I I I I I i I 1 I I I I I I I I I I I i I 

4147 actcgcagaaccccaagtctctcgagtctcaattcccaggactccagtat 4196 

• - . , , 

4200 TGAAATTTCAAAGCTTACTGATAAGGTGCAGGCCGAGTATAGAGACGCCT 424 9 
M I M I I I i i i I f i I I I I I I I I i I I I i I I I I i I I I I i I M I I I i I I I I I I 

4197 tgaaatttcaaagcttactgataaggtgcaggccgagtatagagacgcct 424 6 

4250 ATAGAGAGTACATTGCTCAGATGTCCCAGTTAGAAGGGGGCACAGGGTCG 4299 

M I I I I I I I M I I I I M I I I I I I I I f i I I i I i I I I I I I I I i I I M I I I i t 
4247 atagagagtacattgctcagatgtcccagttagaagggggcacagggtcg 4296 

4300 TCCACAATAAGCGGCAGATCTTCTCCACATAGCACATACTACATAGGCCA 434 9 

I I I I I I I I I I I i I 111 I I II II 11 I I I II I I I I i I I II II II I II II I II 
4297 tccacaataagcggcagatcttctccacatagcacatactacataggcca 434 6 

4350 GAGCTCTTCAGGGGGCTCCATCCATTCTACTCTAGAACAGGAAAGGGGGA 4399 

i M II M I II II II II II I II I I I M I III M II II II II I I II M 1 i I i 
4347 gagctcttcagggggctccatccattctactctagaacaggaaaggggga 4396 

4400 AGGAGGGTGAGCTAAAGCAGGAGGATGGGCGGAAGTCATTTTTAATGAAG 4449 

M M I I I I i I 1 I I I I I II i II I I I I I I II I i I I I I I 1 I II I I I I I I I I I I 
4397 aggagggtgagctaaagcaggaggatgggcggaagtcatttttaatgaag 4446 

4 450 AGGGGGGATGTCATAGATTACTCCTCATCAGGGGTGTCCACTAATGAGGC 4499 

I I I I I i I I II I II I i I I I II II I II 11 I II I t I II 11 I 11 I I i I I I II I I 
4447 aggggggatgtcatagattactcctcatcaggggtgtccactaatgaggc 4496 

* * • • 

4500 CTCACCCCTGGACCCCATTACTGAAGAAGATGAGAAATCCGACCAGTCAG 4549 

I I M M I I I I i i I i I I I I I I I I I I I I I I I I I I I i I I I I I I I I i 1 I I I I I I 
44 97 ctcacccctggaccccattactgaagaagatgagaaatccgaccagtcag 4546 
• 

4550 GTAGTAAGCTACTCCCAGGCAAGAAATCCTCAGAACGGCCCAGTCTCTTC 4599 
M I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

4547 gtagtaagctactcccaggcaagaaatcctcagaacggcccagtctcttc 4596 

... 

4 600 CAGACAGACTTGAAGCTTAAGGGGGGTGGTCTTCGCTACCAGAAACTGCC 4 649 
I I I i I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

4597 cagacagacttgaagcttaaggggggtggtcttcgctaccagaaactgcc 4 646 

... 

4 650 CAGTGATGAAGATGAGTCGGGGACAGAAGAGTCAGATAACACCCCACTGC 4 699 
M I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I M I I- 

4 647 cagtgatgaagatgagtcggggacagaagagtcagataacaccccactgc 4 696 

4700 TCAAAGATGATAAGGACAAAAAGGCTGAAGGCAAAGCAGAGAGAGTGTGC 4749 

I M I I I I I I I I I I I M I I I I I I I I f I I I I I I I I I I I I I I i I I I I I I I I I I 
4 697 tcaaagatgataaggacaaaaaggctgaaggcaaagcagagagagtgtgc 47 46 
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4750 AAGTCCCCAGAGCACAGTGCCGAGCCCATCAGAACGTTCATTAAAGCAftA 47 99 

I I I I I I I I I I I I I I I i i I I I I I I I I i I I I I I I I I I I I I I I i I I I I I U I i 
4747 aagtccccagagcacagt gccgagcccatcagaacgttcatt aaagcaaa 4796 

• • • • • 
4800 AGAGTATTTGTCCGATGCCCTCCTTGACAAAAAGGATTCCTCAGATTCGG 484 9 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I- 1 I I 
4797 agagtatttgtccgatgccctccttgacaaaaaggattcctcagattcgg 4846 

• • , ■ • • • 

4850 GAGTGAGGTCCAATGAGAGTTCTCCCAACCACTCTCTCCACAATGAGGCA 48 99 

I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I i I I I I I I I I I I I I I M I I 
4 847 gagtgaggtccaat gagagttctcccaaccact ct ctccacaatgaggca 4896 

• • • • • 

4 900 GCAGACGACTCCCAACTGGAAAAGGCATIACCTCATAGAGCTTGAAGATGA 4949 

I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I i I I I I I I I I I I I I i I I I I 
48 97 gcagacgactcccaactggaaaaggcaaacctcatagagcttgaagatga 494 6 

• • * ■ • 

4 950 GGGCCACAGTGGGAAGCGGGGGATGCCACACAGTCTGAGTGGCTTGCAAG 4 999 

I I I I I I I I I I i I I I I I I I I I I I i I I I I I I i I I I I I I I I I I i I I I I I I I I I 
4947 gggccacagtgggaagcgggggatgccacacagtctgagtggcttgcaag 4996 

• • • • • 
5000 ATCCAATTATAGCTCGGATGTCCATTTGCTCGGAAGACAAGAAAAGCCCT 504 9 

I I I I I 1 I I I I I I I I I M I i I I I I I I I I I I I I i I I I I I I I I i I I I I f I I I I 
4 997 atccaattatagctcggatgtccatttgctcggaagacaagaaaagccct 504 6 

• • • . • • 
5050 TCCGAGTGCAGCCTGATTGCTAGCAGCCCTGAAGAAAGCTGGCCTGCGTG 5099 

I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I M i i I I I I I I i I I I 
5047 tccgagtgcagcctgattgctagcagccctgaagaaagctggcctgcgtg 5096 

> • • • • 

5100 CCAGAAGGCCTACAATCTAAACCGAACGCCCAGCACTGTGACTCTGAACA 5149 

I I i I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I M I I I I I I I I I I 
5097 ccagaaggcctacaatctaaaccgaacgcccagcactgtgactctgaaca 5146 

- - • • • 

5150 ACAACACTGCACCCACCAACAGAGCCAATCAGAACTTCGATGAGATAGAG 5199 

I I I I I I f i I I I I I I I I i I i I I I I I i I I I I I I I I i I I I I I I I I I I I I I I I I 
514 7 acaacactgcacccaccaacagagccaatcagaacttcgatgagatagag 5196 

• • • * • 
5200 GGAATCAGGGAGACGTCCCAGGTCATTCTGAGGCCTGGCCCCAGTCCCAA 5249 

I I I 1 I I I I I I I I i I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I 
5197 ggaatcagggagacgtcccaggtcattctgaggcctggccccagtcccaa 524 6 

• • • • • 
5250 CCCAACTGCTGTGCAGAATGAAAACCTGAAGAGCATGGCGCACAAGCGAA 5299 

M I I M I i I I I I M I I I I I I I i M I I I I I I I I I M I I I II I I M I I I M I 
5247 cccaactgctgtgcagaatgaaaacctgaagagcatggcgcacaagcgaa 52 96 

• • « . • 
5300 GCCAGCGCTCCAGTTACACAAGGCTCTCGAAAGATGCATCTGAGCTACAT 5349 

I I I I I I I I I t I I I I I I 1 i M I I I i I I I I I I I I I I I I I i I I 1 I I I I I I I I I 
5297 gccagcgctccagttacacaaggctctcgaaagatgcatctgagctacat 5346 
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5350 GCAGCCTCCTCGGAGAGCACAGGCTTTGGAGAAGAGAGAGAGAGCATTCT 5399 

I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I i I I I I I I I M I I I I 
5347 gcagcctcctcggagagcacaggctttggagaagagagagagagcattct 5396 

• • • • , 

5400 C. . . tgaggaaggagaagtgaaaagagtgtgacagactttgagttgacca 5446 

I I I I I I I I I I M i I I I I i I I I I I I I I I I I I I I I i I I I I I I I I M I I I 
5397 ctgatgaggaaggagaagtgaaaagagtgtgacagactttgagttgacca 54 4 6 

• " • • • 
5447 ttcctcattactcatcatgtttatgcatcacagacacctgataggtggtc 54 96 

I I I I I I i I I M I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I 
544 7 ttcctcattactcatcatgtttatgcatcacagacacctgataggtggtc 54 96 

5497 aggcactggaagcagacaggaaagcctgaaagcacagtcaccagtcattt 554 6 
lllilllllillllllllllllllllllllllllllllllllllllllll 

5497 aggcactggaagcagacaggaaagcctgaaagcacagtcaccagtcattt 5546 

• • • 

5547 tcattgtcaccgtcatcccaggcactttaaagtctcactgccactattac 5596 
I I I I I I i I I I I I I I I I I M I I I I I I I I I I M I I I I I I I I I I I I I M I I I I 

5547 tcattgtcaccgtcatcccaggcactttaaagtctcactgccactattac 5596 

5597 acctttcctggcacacagctaggcagagcgatgagagcaggctgaagaca 564 6 

I I i i I i I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
5597 acctttcctggcacacagctaggcagagcgatgagagcaggctgaagaca 564 6 

• - . . 
5647 aaataggtcaaaagtgtatttaaagttgaaagggcttaaatcacaaggaa 5696 

i I I M I I I I I I I I I I i I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
5647 aaataggtcaaaagtgtatttaaagttgaaagggcttaaatcacaaggaa 5696 

• " • • • 
5697 aaaaacttagagatctcctttataaactattcatgtaccatttgagggaa 5746 

I I i I I I I i I i I I I I I I I I I I I I I I I I I I M I I I I i M I I I i I I I I i I I I I 
5697 aaaaacttagagatctcctttataaactattcatgtaccatttgagggaa 5746 
• 

5747 gatacagtgttagatgaagcagaaaccaattttattgttagtgtttaatc 5796 

IMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIilllMllllllll 
5747 gatacagtgttagatgaagcagaaaccaattttattgttagtgtttaatc 5796 

• • 

5797 ttggtgcagatttataagttttagagtagcccagagactaaaagtgaata 5846 
I I I I I I I M I I I I I M I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I 

57 97 ttggtgcagatttataagttttagagtagcccagagactaaaagtgaata 5846 

• • . . 
5847 cttagcaaatggatagccagtgttctatataggaatcattgcttttcaga 5896 

M I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I 
5847 cttagcaaatggatagccagtgttctatataggaatcattgcttttcaga 5896 
• 

5897 gggcttaaagtttaagtagaaaatatatactcaagaaggcgataaaagct 5946 
I i I M I I I I I I I I I i I I I I I i I I I I I I I M I I I I I I I I I I I I I I I I I I i I 

58 97 gggcttaaagtttaagtagaaaatatatactcaagaaggcgataaaagct 594 6 
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5947 gatgagaaagtgagttagcagaacccaaagccgtgctgggccgcggtgacV'5996 

I I II i M I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I 
5947 gat gagaaagtgagtt agcagaacccaaagccgtgct gggccgcggtgac 5996 

• • • ■ • 

5997 tcattagcagaggaggaggggagggcagtatattcctgggatactctctc 604 6 

I I I I II I I i II I I I I I I I I I I I I I II I II I I I I I II II I I II II I I I I II 
5997 tcattagcagaggaggaggggagggcagtatattcctgggatactctctc 6046 

■ _ • • • • 

6047 cagacccagcctggcttctgacatcatccacctgtgccctcaaaaccgtc 6096 

I II I I II I I I I I I II I I II I I II i I II I II I M M II I I I II I I II I I I I 
6047 cagacccagcctggcttctgacatcatccacctgtgccctcaaaaccgtc 60 96 

• • • • • 
6097 ttagtctgttctgcaactcttaagtgacatacctaactcagctcatggct 6146 

I I I I I 1 I I I I I I I I I I 11 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
6097 ttagtctgttctgcaactcttaagtgacatacctaactcagctcatggct 614 6 

6147 aaggaaaaaaaattaaagttgtcctggtgattaaactctggacctcccac 6196 

II I II I I II I I II i I I I I I I I I I I i I I I I I I I I I II I I I II I I I I I I I I I 
6147 aaggaaaaaaaattaaagttgtcctggtgattaaactctggacctcccac '6196 

6197 atctaagtcccgagttgacaaactgcatccccagcccaaacatctgttct 6246 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I 
6197 atctaagtcccgagttgacaaactgcatccccagcccaaacatctgttct 6246 

6247 tccagagatctggaaagacatgggcataagagtccagtaaactcttaatg 6296 

I I I II I I I I I II I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I 
6247 tccagagatctggaaagacatgggcataagagtccagtaaactcttaatg 6296 

• • • • • 

62 97 aaggtgtcgggaagtgtttcccttaaaggggttagtgacagtgacactag 6346 

I I II I II II I II I I I I I I I I I I I I I I I II I II I I I I II I Ml II II I I II 
6297 aaggtgtcgggaagtgtttcccttaaaggggttagtgacagtgacactag 634 6 

• • • • • 
6347 acaagtgacagaatttagtttgctgacattaaatcactgccttctgggcc 6396 

I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
6347 acaagtgacagaatttagtttgctgacattaaatcactgccttctgggcc 6396 

• • ■ • • 
6397 gcaatagtttagtatctgttcctaaatcagcatttgagttctgcgggggt 644 6 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I 
6397 gcaatagtttagtatctgttcctaaatcagcatttgagttctgcgggggt 64 4 6 

• • • ■ • 
6447 tacaaaatggaggcatggttttaagaaggaatttgaaagcatccagggaa 6496 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
64 47 tacaaaatggaggcatggttttaagaaggaatttgaaagcatccagggaa 64 96 

• • • • « 
6497 gcaagaacagagtcggagctggtgtgaccatgacacttcccattattgtt 654 6 

I I I I M I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
64 97 gcaagaacagagt cggagctggtgtgaccatgacactt cccattattgtt 654 6 
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6547 agcaaagctaggagaatgaccacatacatttaagtatcatgtaggctaga 6596 

I I I i 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I 
6547 agcaaagct aggagaatgaccacat acattt aagtatcatgt aggctaga 6596 

• • • • • 
6597 gcgtgcctgtcatcatcatgcattgagaccgtttgtcctttgattatagt 664 6 

I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I i I I I I I I I I I 
6597 gcgtgcctgtcatcatcatgcattgagaccgtttgtcctttgattatagt 664 6 

• • • • • 
6647 gtgtttatgtagaccagtcctagcagtatgtcctgagccatgggcaggat 669 6 

I I I i I I I I I I I I i I I I I I I I I I I I I M I I I I I I I I I I I I I I i I I I I I I I I 
6647 gtgtttatgtagaccagtcctagcagtatgtcctgagccatgggcaggat 6696 

• • - • • • 
6697 tgtagatggcgaggtggcaggtttagtgctcgttgagatgtccatctctg 674 6 

1 I I i I I I I ! I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I 
6697 tgtagatggcgaggtggcaggtttagtgctcgttgagatgtccatctctg 674 6 

• • • • ■ 
6747 actggacctgtgtggtttacttgtagctgctccgtgaaacactgccttgc 6796 

I I I I I I i i I I I 1 I I I i I i I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I 
6747 actggacctgtgtggtttacttgtagctgctccgtgaaacactgccttgc 6796 

• • • • • 
6797 tgtttgagtgtaattgagcattttcatgaagtgtatatgttgcatgatct 684 6 

I i I I I I I I I I I I I I I I I I I I i I I I I I i I I I i I I I I I I I I I I I I I I I I I I I 
6797 tgtttgagtgtaattgagcattttcatgaagtgtatatgttgcatgatct 684 6 

• • • • • 
6847 ttaacaggtatttttagacattaagaacagcatgatattactgacatatg 6896 

i I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I i I I I I I I I I 
6847 ttaacaggtatttttagacattaagaacagcatgatattactgacatatg 6896 

• • • • • 
6897 cattagtagtatgcagtatcatgtaatgggtacttcagaggcagtttggt 6946 

i I I I I I I i i I i I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I 
6897 cattagtagtatgcagtatcatgtaatgggtacttcagaggcagtttggt 6946 

• • • , 

6947 ttctttccaggtgagtagttatagctcacagtcatgggaatcctttattc 6996 

I M f I i i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I i I I I I I 
6947 ttctttccaggtgagtagttatagctcacagtcatgggaatcctttattc 6996 

■ • • • . 

6997 acactgtctgtgaagtcgatgctggagtgccaaatgcacacggagactag 704 6 

I M M M I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I 
6997 acactgtctgtgaagtcgatgctggagtgccaaatgcacacggagactag 7046 

• • ' • • • 
7047 gaacaataaagacttgcatgccattctgactcatggagttttattcaata 7096 

I I I I I I i I I I I I M I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
7047 gaacaataaagacttgcatgccattctgactcatggagttttattcaata 7096 

• • • • 

7097 gagcaatgataattcccattctgattaaaaattaaaacacggaa 7140 

I I M M I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I 
7097 gagcaatgataattcccattctgattaaaaattaaaacacggaa 7140 
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1 MSVLISQSVINYVEEENIPALKALLEKCKDVDERNECGQTPLMLAAEQGN 50 

I I I I I I I I I I I I I I I I I I i t I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I 
1 MSVLISQSVINYVEEENIPALJCALLEKCKDVDERNECGQTPLMLAAEQGN 50 

• • • ■ « 

51 VEIVKELLKNGANCNLEDLDNWTALISASKEGHIHIVEELLKSGASLEHR 100 

I I I I I I I I I i I I I I I I I I I i I I I i I I I I I I I I I I i I I I I I I I I I I I I I I I 
51 VEIVKELLKNGANCNLEDLDNWTALISASKEGHIHIVEELLKSGASLEHR 100 

> • • • • 

101 DMGGWTALMWACYKGRTDWELLLSHGANPSVTGLQYSVYPIIWAAGRGH 150 

I I i I I 1 I I I I I I I I I I I I I I I I I M I I I I I I I I I I I i I I I I I I I I I I I I 
101 DMGGWTALMWACYKGRTDWELLLSHGANPSVTGL. YSVYPIIWAAGRGH 149 

• • • • • 

151 ADIVHLLLQNGAKVNCSDKYGTTPLVWAARKGHLECVKHLLAMGADVDQE 200 

I i I I I i I I I I I I I I I I I I I t I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I 
150 ADIVHLLLQNGAKVNCSDKYGTTPLVWAARKGHLECVKHLLAMGADVDQE 199 

• • • • » 

201 GANSTTALIVAVKGGYTQSVKEILKRNPNVNLTDKDGNTALMIASKEGHI 250 

I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I i I I I i I I I I I I I I I I I I 
200 GANSMTALIVAVKGGYTQSVKEILKRNPNVNLTDKDGNTALMIASKEGHI 249 

■ • • • ■ • 

251 EIVQDLLDAGTYVNIPDRSGDTVLIGAVRGGHVEIVRALLQKYADIDIRG 300 

I I I I I I I i I I I I I I I I I I I ! I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I 
250 EIVQDLLDAGTYVNIPDRSGDTVLIGAVRGGHVEIVRALLQKYADIDIRG 299 

• • • . . 

301 QDNKTALYWAVEKGNATMVRDILQCNPDTEICTKDGETPLIKATKMRNTE 350 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I i I I I I I 
300 QDNKTALYWAVEKGNATMVRDILQCNPDTEICTKDGETPLIKATKMRNIE 349 

• • - • • 

351 WELLLDKGAKVSAVDKKGDTPLHVAIRGRSRRLAELLLRNPKDGRLLYR 400 

I I 1 I I I I I I I I I i I I I I I I I I I I i I I I I I I I I I I I I I I I i i I I I I I I I I I 
350 WELLLDKGAKVSAVDKKGDTPLHVAIRGRSRRLAELLLRNPKDGRLLYR 399 

• • • • • 

4 01 PNKAGETPYNIDCSHQKSILTQIFGARHLSPTETDGDMLGYDLYSSALAD 4 50 

I I i I I I I I I I I I I I I I i I I I I I I I I i I i I I I I I M I I I I I i I I I I I I I I I 
400 PNKAGETPYNIDCSHQKSILTQIFGARHLSPTETDGDMLGYDLYSSALAD 4 49 

• • • • • 

4 51 ILSEPTMQPPICVGLYAQWGSGKSFLLKKLEDEMKTFAGQQTEPLFQFSW 500 

I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
450 ILSEPTMQPPICVGLYAQWGSGKSFLLKKLEDEMKTFAGQQTEPLFQFSW 4 99 

• • • . • 

501 LIVFLTLLLCGGLGLVFAFTVDTNLAIAISLSFLALIYIFFIVIYFGGRR 550 

I I i I I I i I i i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I 
500 LIVFLTLLLCGGLGLVFAFTVDTNLAIAISLSFLALIYIFFIVIYFGGRR 549 



76/79 

SUBSTITUTE SHEET (RULE 26) 

BNSDOCID: <WO 0220786A2_L> 



wo 02/20786 



PCT?GB01/03977 



Figure 25 (page 2 of 4) 

551 EGESWNWAWALSTRLARHIGYLELLFKLMFVNPPELAEQTTKALPVRFLF 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
550 EGESWNWAWALSTRLARHIGYLELLFKLMFVNPPELPEQTTKALPVRFLF 599 

601 TDYNRLSSVGGETSLAEMIATLSDACEREFGFLATRLFRVLRTEESQGKK 650 

I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 
600 TDYNRLSSVGGETSLAEMIATLSDACEREFGFLATRLFRVFRTEESQGKK 649 

• ' ■ • • . • 

651 KWKRTCCLPSFVIFLFIVGCIIAGITLLAIFRVDPKHLTVNAILISIASV 700 

I I I : I I i I I I I I I I I I I I I I I I I I I i I I I I I I I I I i I I I I I I M I I M I I 
650 KWKKTCCLPSFVIFLFIVGCIIAGITLLAIFRVDPKHLTVNAILISIASV 699 

• • • • • 

701 VGLAFVLNCRTWWQVLDSLLNSQRKRLHSAASKLHKLKSEGFMKVLKCEV 750 

I I I I I I I I I I i I i I I I I I I I i I I I 11 I M I I I M I I I I I I I I I I I I I i I I 
700 VGLAFVLNCRTWWQVLDSLLNSQRKRLHSAASKLHKLKSEGFMKVLKCEV 749 

• ■ • • • 

751 ELMARMAKTIDSFTQNQTRLWIIDGLDACEQDKVLQMLDTVRVLFSKGP 800 

I I I I I I I I I i M I I I I I I I I i I I I I I I I I I I I I I I I I i I I I I I I I I I I I I 
750 ELMARMAKTIDSFTQNQTRLWIIDGLDACEQDKVLQMLDTVRVLFSKGP 799 

• • • • • 
801 FIAIFASDPHIIIKAINQNLNSVLRDSNINGHDYMRNIVHLPVFLNSRGL 850 

I I I I I M I I I I I I I I I i i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
800 FIAIFASDPHIIIKAINQNLNSVLRDSNINGHDYMRNIVHLPVFLNSRGL 849 

• • • ■ • 

851 SNARKFLVTSATNGDITCSDTTGTQEDTDRRVSQNSLGEMTKLGSKTALN 900 
I I ) I i I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I f I I I I I I I I I I I I I I 

850 snarkflvtsatngditcsdttgtqedtdrrvsqnslge;mtklgsktaln 899 

• • • • • 

901 RRDTYRRRQMQRTITRQMSFDLTKLLVTEDWFSDISPQTMRRLLNIVSVT 950 

1 1 1 I I M 1 1 I 1 1 1 1 1 1 1 1 I 1 1 1 1 I I I I i 1 1 1 1 1 i 1 1 1 1 1 I i 1 1 1 1 1 1 I 1 1 

900 RRDTYRRRQMQRTITRQMSFDLTKLLVTEDWFSDISPQTMRRLLNIVSVT 949 

• • • > • 

951 GRLLRANQITFNWDRLASWINLTEQWPYRTSWLILYLEETEGLPDQMTLK 1000 

I I I t I I I I I i i i I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
950 GRLLRANQITFNWDRLASWINLTEQWPYRTSWLILYLEETEGLPDQMTLK 999 

• • • • • 

1001 TIYERISKNIPTTKDVEPLLEIDGDIRNFEVFLSSRTPVLVARDVKTFLP 1050 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1000 TIYERISKNIPTTKDVEPLLEIDGDIRNFEVFLSSRTPVLVARDVKTFLP 1049 

• • • • • 

1051 CTVNLDPKLRETIADVRAAREQINIGGLAYPPLPLHEGPPRPPSGYSQPA 1100 

I I I i I I I I I I I I I I I I I I I M I I I I I I I I I I I I I i I I I I i I I I I j I I I I 
1050 CTVNLDPKLREIIADVRAAREQINIGGLAYPPLPLHEGPPRPPSGYSQPA 1099 

• • - • • 

1101 SVCSSASFNGPFPGGVVSPQPHSSYYSGLSGPQHPFYNRPFFAPYLYTPR 1150 

i 1 1 i 1 1 1 1 1 1 1 1 1 1 1 i i I i I i 1 1 i I i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1100 SVCSSASFNGPFPGGWSPQPHSSYYSGLSGPQHPFYNRPFFAPYLYTPR 1149 

• • - • • 
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1151 YYPGGSQHLISRSSVKASLPRDQNNGLPCDSGFNKQRQGSRPCHRQ]1.TVT -1200 

I I I I I I I I I I I I i I I I M I I I I I I I I I I I I I I I I I I I . i 
1150 YYPGGSQHLISRSSVKTSLPRDQNNGLPCDSGFNKQRQAAVPATGSSLLL 1199 

1201 EFNDRGCRCEKLRQIEGLDQSMMPQYCTTIKKANINGRVLSQCNIDELKK 1250 

I I i I i I I I i I I I I I I I I I I i i I I I I I I I I I I I I I i I i I I I I I 
1200 SSMTVDWCEKLRQIEGLDQSMMPQYCTTIKKANINGRVLSQCNIDELKK 1249 

1251 EMAMNFGDWHLFRSMVLEMRSVESQWPEDPRFLNENSSAPVPHGESARR 1300 

I I I I i I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I 
1250 EMAMNFGDWHLFRSMVLEMRSVESQWPEDPRFLNENSSAPVPHGESARR 1299 

• • • • • 

1301 PSHTELPLTELSSQTPYTLNFSFEELNTLGLDEGAPRHSNLSWQSQTRRT 1350 

I I I I I I I I I I I I I I I I I I I i I I I I I I I t I I I I I I I I I I I I I I I I I I i I I 
1300 SSHTELPLTELSSQTPYTLNFSFEELNTLGLDEGAPRHSNLSWQSQTRRT 1349 

• * m m m 

1351 PSLSSLNSQDSSIEISKLTDKVQAEYRDAYREYIAQMSQLEGGTGSSTIS 1400 

I I I I I I I I I I I I I I I I I I N I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I 
1350 PSLSSLNSQDSSIEISKLTDKVQAEYRDAYREYIAQMSQLEGGTGSSTIS 1399 

• • • • • 

1401 GRSSPHSTYYIGQSSSGGSIHSTLEQERGKEGELKQEDGRKSFLMKRGDV 1450 

I I I I I I I I I I I I i i I I I I I I I I I I I I I I i I I I I I I I I I I i I I I I I i I I I I 
1400 GRSSPHSTYYIGQSSSGGSIHSTLEQERGKEGELKQEDGRKSFLMKRGDV 1449 

• • * • • 

1451 IDYSSSGVSTNEASPLDPITEEDEKSDQSGSKLLPGKKSSERPSLFQTDL 1500 

I I I I M I I I I I I I I I i i I I I I I I I I M I I i I I I I I I I I I I I I i I I i I I I I 
1450 IDYSSSGVSTNEASPLDPITEEDEKSDQSGSKLLPGKKSSERPSLFQTDL 1499 

1501 KLKGGGLRYQKLPSDEDESGTEESDNTPLLKDDKDKKAEGKAERVCKSPE 1550 

I I 1 I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I 
1500 KLKGGGLRYQKLPSDEDESGTEESDNTPLLKDDKDKKAEGKAERVCKSPE 1549 

• • • • • 

1551 HSAEPIRTFIKAKEYLSDALLDKKDSSDSGVRSNESSPNHSLHNEAADDS 1600 

I I I I I I I I I .1 I I I I I I I I I I I I I I i I I i I I I I I I I I I I I I I I I I I I I I I I 
1550 HSAEPIRTFIKAKEYLSDALLDKKDSSDSGVRSNESSPNHSLHNEAADDS 1599 

* • • • • • 

1601 QLEKANLIELEDEGHSGKRGMPHSLSGLQDPIIARMSICSEDKKSPSECS 1650 

I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1600 QLEKANLIELEDEGHSGKRGMPHSLSGLQDPIIARMSICSEDKKSPSECS 1649 

• • ■ • • 
1651 LIASSPEESWPACQKAYNLNRTPSTVTLNNNTAPTNRANQNFDEIEGIRE 1700 

I I I I I i I I I I I I I I I I I I I I i I I I I I I I I i I I I I I I I I I I I I I I I I I I I I 
1650 LIASSPEESWPACQKAYNLNRTPSTVTLNNNTAPTNRANQNFDEIEGIRE 1699 

• • • * • 
1701 TSQVILRPGPSPNPTAVQNENLKSMAHKRSQRSSYTRLSKDASELHAASS 1750 

I I I I M I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
17 00 TSQVILRPGPSPNPTAVQNENLKSMAHKRSQRSSYTRLSKDASELHAASS 1749 
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1751 ESTGFGEERESIL 
I I I I I I I I I I I I I 
1750 ESTGFGEERESIL 



Figure 25 (page 4 of 4) 

1763 
1762 
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Continuation of Box I.l 

Although claims 98, 99 and 114 and claim 100 as far as it refers to 
claims 98-99 are directed to a method of treatment of the human/animal 
body, the search has been carried out and based on the alleged effects of 
the compound/composition. 

Although claims 112-113 are directed to a diagnostic method practised on 
the human/animal body, the search has been carried out and based on the 
alleged effects of the compound/composition. 



Continuation of Box I.l 



Rule 39.1(iv) PCT - Method for treatment of the human or animal body by 

therapy 

Rule 39.1(iv) PCT - Diagnostic method practised on the human or animal 
body 



Continuation of Box 1.2 



Present claims 67, 74-76, 79-81, 101, 102 and 104 relate to 
compounds/methods defined by reference to a desirable characteristic or 
property, namely 

Claim 67 : method for modulating the function of the polypeptide of 
claims 33-42 by reducing the ability of said polypeptide to bind ATP; 
Claims 74- 76 and 79-81 (as far as they refer to claims 68-72): an agent 
Identifiable by the method of claims 68-72; 

Claims 101 and 102: as far as they relate to an agent capable of 
Inhibiting the function of the K1d1ns220 gene or its products, not being 
an antibody according to claim 51-61 or an antisense polynucleotide to 
the polynucleotides oif claims 1-10, 17, 20, or 22; 

Claim 104: use of an agent which is capable of detecting the expression 
of the K1d1ns220 gene. 

The claims cover all compounds/methods having the Indicated 
characteristics or properties, whereas the application provides support 
within the meaning of Article 6 PCT and/or disclosure within the meaning 
of Article 5 PCT for only a very limited number of such 
products/compounds/methods/apparatus. In the present case, the claims so 
lack support, and the application so lacks disclosure, that a meaningful 
search over the whole of the claimed scope Is impossible. Independent of 
the above reasoning, the claims also lack clarity (Article 6 PCT). An 
attempt is made to define the cmpounds/methods by reference to a result 
to be achieved. Again, this lack of clarity In the present case Is such 
as to render a meaningful search over the whole of the claimed scope 
impossible. Consequently, the search has been carried out for those parts 
of the claims which appear to be clear, supported and disclosed, namely 
those parts relating to antibodies and antisense molecules of the claimed 
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polypeptides and polynucleotides. 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to Inventions In respect of which no international 
search report has been established need not be the subject of an 
International preliminary examination (Rule 66.1(e) PCT). The applicant 
is advised that the EPO policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
the case Irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 



BNSDOCID: <WO 0220786A3_I_> 



